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CHAPTER  I 


INTRODUCTION 

An  ever-grov?ing  dependence  upon  sophisticated  tech- 
nology for  success  in  military  endeavors  has  intensified 
the  importance  of  reliability.  At  the  same  time,  con- 
tinuing increases  in  modern  weapon  system  complexity  have 
made  the  achievement  of  this  reliability  extremely  diffi- 
cult. Intuitively,  equipment  with  a high  degree  of  relia- 
bility implies  that  the  equipment  will  operate  longer  with- 
out failure  and  will  have  a greater  likelihood  of  being 
ready  for  use  when  needed.  More  technically,  reliability 
is  defined  in  terms  of  probability.  Specifically,  relia- 
bility is  the  probability  that  a system,  component,  item, 
unit,  or  piece  of  equipment  will  adequately  perform  its 
intended  function  for  a required  interval  of  time  under 
expected  operating  conditions  (10:205). 

Terminology 

Because  of  the  many  technical  terms  and  mathemat- 
ical concepts  utilized  in  discussing  the  complex  nature  of 
reliability  theory,  a list  of  definitions  is  inserted  early 
in  the  thesis  for  reader  familiarization.  This  initial 
introduction  of  definitions  will  be  reinforced  by  expanded 
explanations  of  many  of  the  terms  as  they  occur  throughout 
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the  research  effort.  This  reinforcement  technique  is  used 
to  carefully  guide  the  reader  through  the  technical  discus- 
sions required  in  the  thesis.  Terminology  given  will  be 
for  the  purpose  of  clarity  and  consistency  within  this 
thesis . 


Aggregated  data.  Data  points  which  have  been 
grouped  together  from  two  or  more  samples  or  populations  so 
that  the  initial  source  is  no  longer  identifiable. 

Availability.  A measure  of  the  degree  to  which  a 
unit  is  operable  at  the  start  of  a mission  at  some  random 
point  in  time. 

Burn-in.  The  early  operation  of  a unit  to  stabi- 
lize its  operational  characteristics. 

» Catastrophic  failure.  A failure  which  occurs 

suddenly  without  warning  and  which  results  in  a complete 
failure  of  a unit. 

2 

Chi-square  test  (x  ) • A Goodness-Of-Fit  (G-O-F) 
test  for  determining  whether  or  not  a reliability  function 
based  on  observed  data  can  be  described  by  a specific 
theoretical  function. 
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Coir.por.ent/Item/Unit.  Interchangeable  terms  for  any 
part,  subassembly,  line  replaceable  unit,  system,  or  piece 
I of  equipment  whose  failure  and  replacement  must  be  consid- 

ered in  reliability  analysis. 

Component  population.  The  total  number  of  units  in 
a given  situation  from  which  samples  may  be  drav.-n. 

Constant  maintenance  policy.  A maintenance  policy 
which  dees  not  change  over  the  period  of  time  that  is  being 
considered. 

Convolution.  The  process  of  randomly  selecting 
values  from  different  samples  and  then  summing  these  values 
to  create  a new  random  variable. 

Debugging.  A reliability  procedure  used  to  detect 
and  eliminate  early  failures  and  thus  stabilize  the  oper- 
ation of  a unit. 

Exponential  distribution.  The  exponential  variation 
of  the  probability  of  occurrence  of  a variable  with  respect 
to  a change  in  tim.e. 

Failure . The  inability  of  a unit  to  perform  ade- 
quately; i.e.,  the  opposite  of  "operating  normally."  For 
purposes  of  this  research,  a unit  has  only  twe  states  of 
nature;  either  the  unit  is  operational,  or  the  unit  has 
failed. 
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iTailure  rate.  The  number  of  failures  cf  a unit  per 
interval  measure  of  time. 

I 

j Hazard  rate.  The  instantaneous  failure  rate  of  the 

components  in  a system,  under  consideration;  or,  the  limit 
of  the  failure  rate  as  the  interval  length  approaches  zero. 

Infant  m.ortality.  The  premature  catastrophic  fail- 
ure in  the  early  operating  life  of  a unit  which  occurs  at  a 
rate  substantially  greater  than  would  norm.ally  be  expected. 

Mean  Time  Between  Failure  (MTBF) . For  a given  time 
interval,  the  total  functioning  life  of  a population  of  a 
unit  divided  by  the  total  number  of  failures  within  the 
population  during  the  measurement  interval.  This  MTBF 
definition  is  applicable  only  to  the  exponential  distri- 
bution. For  other  statistical  distributions,  the  MTBF 
calculation  must  be  computed  on  an  individual  basis. 

Planned  life.  The  expected  operational  life  of  a 
unit  at  the  time  of  acquisition. 

1 Random  failure.  A failure  whose  occurrence  is 

predictable  only  in  a probabilistic  or  statistical  sense. 

Simulation.  A set  of  test  conditions  designed  to 
duplicate  operating  and  usage  environment  and  conditions 
as  closely  as  possible. 
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steady  state.  The  normal  expected  operating  state 
for  a given  unit.  For  the  exponential  case , the  steady 
state  is  the  condition  of  a constant  failure  (hazard)  rate . 

Wearout . An  attrition  process  which  results  in  an 
increase  of  the  failure  rate  with  increasing  age  (32:10). 

Statement  of  the  Problem 

The  Air  Force  implicitly  assumes  that  unit  failures 
follow  an  exponential  distribution  in  establishing  the  pro- 
cedures for  determining  support  requirements  based  on  the 
number  of  failures.  The  use  of  the  exponential  distri- 
bution implies  a constant  failure  rate  with  respect  to  time 
for  any  component  population.  A constant  failure  rate, 
which  is  also  referred  to  as  a constant  hazard  rate,  infers 
that  all  failures  occur  stochastically  (randomly) . There- 
fore, the  Air  Force's  use  of  exponential  distribution  with 
its  associated  constant  hazard  rate  implies  that  a unit's 
failure  is  as  apt  to  occur  after  the  first  hour  of  oper- 
ation as  it  is  to  occur  after  a thousand  hours  of  operation 
(26).  The  "exponential  assumption"  ignores,  or  assumes  to 
be  insignificant,  the  realities  of  infant  mortality  and/or 
wearout  (aging)  as  considerations  for  a unit's  failure. 
Instead,  emphasis  is  shifted  to  the  computational  simplic- 
ity of  the  use  of  the  exponential  distribution  by  designing 
failure  data  collection  systems  based  on  the  parameter, 
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Mean  Time  Between  Failures  (MTBF) , of  the  exponential 
distribution. 

I 

I Under  the  exponential  assumption  of  a constant 

failure  rate,  only  the  total  operating  time  and  the  total 
number  of  failures  are  needed  to  mathematically  compute  a 
point  estimate  of  the  MTBF  (35) . Many  current  data  col- 
lection systems  for  assessing  reliability  are  designed  only 
to  record  these  failure  data.  But  this  level  of  detail  is 
only  sufficient  for  the  determination  of  the  MTBF,  the  one 
parameter  that  describes  the  exponential  distribution. 
Therefore,  the  assumption  of  the  exponential  function  as 
describing  the  underlying  failure  model  based  on  this  com- 
bined or  aggregated  failure  data  is  made  out  of  sheer  neces- 
sity (28:3) . 

The  use  of  the  exponential  MTBF  formula  suggests  a 
unit's  immediate  entry  into  a steady  state  condition. 

Steady  state  is  a condition  in  which  failures  occur  at  a 
constant  rate;  i.e.,  the  condition  of  a constant  hazard 
rate.  Thus,  an  assumption  of  the  exponential  function  as  a 
model  for  describing  a unit's  failure  process  implies  that 
the  unit  always  operates  in  a steady  state  condition.  But, 
in  fact,  computer  simulation  models  indicate  that  the  time 
to  reach  steady  state  may  range  up  to  sixteen  years  under 
certain  maintenance  policies  (8:72-7;  11:53). 

The  problem  is  that  the  current  Air  Force  assumption 
of  a constant  failure  rate  nay  be  erroneous  until  a steady 
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state  is  actually,  if  ever,  attained.  If  so,  the  errors 
introduced  into  logistics  planning  result  in  misstated  sup- 
port requirements  and  thus  erroneous  cost  estimates.  These 
requirements  include  how  many  spare  parts  are  to  be  pur- 
chased, how  many  support  manhours  are  to  be  scheduled,  what 
support  facilities  are  to  be  made  available,  and  even  the 
number  of  units  that  are  required  to  be  purchased  to  sup- 
port a given  mission  (8). 

Justification  for  the  Research 

Today,  in  the  Air  Force,  the  total  cost  of  support 
for  a given  system  is  at  least  equal  to  the  initial  acqui- 
sition cost;  therefore,  a need  exists  to  constantly 
question  current  logistical  practices,  including  those 
involving  reliability.  The  Air  Force  needs  to  continually 
search  for  more  accurate,  economical  procedures  and  methods 
to  apply  to  the  attainment  of  a specified  mission.  With 
sky-rocketing  inflation  and  increasing  attention  by  Con- 
gress on  Air  Force  and  Department  of  Defense  expenditures, 
challenges  to  out-dated  ways  of  performing  specific 
functions  must  be  m.ade  to  provide  a strong  defense  posture 
at  the  lowest  total  cost. 

The  maintenance  of  a strong  defense  by  the  United 
States  requires  the  acquisition  of  the  best,  most  reliable 
weapon  systems  that  are  available.  But  if  the  capability 
to  obtain  these  weapon  systems  is  lessened  by  the  cost  of 
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the  weapon  systems'  design  and  associated  support  require- 
ments, ways  and  means  must  be  found  and  used  to  decrease 
these  costs  while  still  providing  effective  weapon  systems. 
Primary  justification  of  this  thesis  rests  on  the  premise 
that  the  exponential  assumption  of  component/system  fail- 
ures may  be  inappropriately  applied  in  many  situations  and, 
thus  influence  the  cost  of  the  design,  acquisition,  and 
support  of  modern  weapon  systems. 

This  thesis  challenges  the  common  practice  of 
assigning  that  unit/equipment  failures  are  generated  from  a 
stochastic  process  whose  underlying  distribution  is  expo- 
nential. If  the  exponential  assumption  is  indeed  inappro- 
priate, then  the  related  costs  of  making  such  an  assumption 
may  be  substantial. 

Management  needs  current,  accurate  information 
about  unit/equipment  failure  distributions  to  determine 
necessary  resource  requirements.  These  requirements 
include  planning  the  number  and  time  of  reordering  spares, 
anticipating  maintenance  and  other  support  manpower,  and 
other  considerations  during  the  planned  life  of  the  compo- 
nents/systems. In  this  age  of  spiraling  costs  for  these 
requirements  and  of  increased  demands  on  modern  weapon 
systems,  challenges  must  be  made  to  established  practices 
and  paradigms  in  a continual  search  for  the  best  way  to 
cope  with  increased  requirements  and  decreased  resources. 
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Military  Assumptions 


Air  Force  Reguration  (AFR)  80-5,  Reliability  and 
Maintainability  Programs  for  Systems,  Subsystems,  Equipment, 
and  Munitions,  states  that  reliability  influences  logistical 
requirements  and,  therefore,  is  an  essential  part  of  an 
effective  weapon  system  (37)  . Current  trends  toward  the 
acquisition  of  weapon  systems  based  on  life  cycle  cost 
criteria,  including  logistical  support,  dictate  that  reli- 
ability be  considered  throughout  the  system's  total  life. 

In  conjunction  with  these  reliability  considerations 
throughout  the  Air  Force,  AFR  80-5  states  that  adequate 
data  reporting  systems  will  be  provided  to  aid  management 
in  assessing  weapon  system  reliability. 

These  data  collection  systems  are  normally  based  on 
the  criteria  as  set  forth  by  the  military  standards 
(MIL-STD)  specified  by  the  DOD  procuring  agency  at  the  time 
of  the  systems'  acquisition.  MIL-STD-785,  Requirements  for 
Military  Programs  (for  Systems  and  Equipment /Development 
and  Production) , is  the  primary  reliability  publication 
used  in  the  preparation  of  specifications  and  contractural 
documents  by  the  DOD  for  contractor  reliability  programs 
(36:1).  This  publication  permits  the  contractor  to  assume 
that  equipment  failures  follow  the  exponential  distribution 
wherever  this  military  standard  is  referenced  in  the  con- 
tract (36  :8)  . 
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MIL-STD-7S1B , Reliability  Tests: 


ExDonential 


Distribution,  also  inplies  that  failures  are  exponentially 
distributed;  in  fact,  the  entire  military  standard  is 
fundamentally  based  on  this  assumption.  This  document  out- 
lines reliability  qualifications,  reliability  production 
acceptance,  and  longevity  tests  (35:1). 

MIL-STD-756,  Reliability  Prediction,  facilitates 
the  use  of  the  exponential  distribution  in  military  reli- 
ability prediction  (34:2,  5).  The  DOD  generally  assumes 
" ...  in  their  contracting  and  specification  reliability 
standard  that  the  single  most  important  parameter  is  the 
MTBF  i_8:15J."  The  parameter  MTBF  is  defined  in  MIL-STD-721B , 
Definitions  of  Effectiveness  Terms  for  Reliability,  Main- 
tainability, Human  Factors,  and  Safety,  as  " . . . the 
total  functioning  life  of  a population  of  an  item  divided 
by  the  total  number  of  failures  within  the  population  during 
the  measurement  interval  . . . [32:5]."^ 

This  standard  definition  of  MTBF  supports  the  use 
of  the  exponential  assumption  of  failures  in  the  develop- 
ment of  data  collection  systems  and  serves  to  encourage  the 
aggregation  of  failure  data.  Such  aggregated  data  gener- 
ally consist  of  only  the  total  operating  time  and  the  total 
number  of  failures.  Although  these  data  are  sufficient  for 

listing  of  other  documents  of  interest  used  by 
the  Air  Force  in  military  reliability  prediction  and  con- 
tractual specif ications  is  presented  in  Appendix  A. 
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evaluating  the  MTBF  parameter  of  the  er.ponential  distri- 
bution, they  are  useless  in  determining  the  parameters  of 
two-parameter  distributions  such  as  the  gamiria  and  Vveibull. 

Fortunately,  a few  data  collection  systems  main- 
tained by  depot  level  repair  facilities  do  not  aggregate 
data.  These  data  systems  record  operating  times,  the  num- 
ber of  failures  for  a given  unit,  and  other  data  that  are 
necessary  in  a failure  analysis.  When  data  are  collected 
in  this  manner,  parameters  can  be  calculated  for  one  or 
two  parameter  distributions.  The  one  parameter  exponential 
model  can  then  be  compared  with  two  parameter  models,  such 
as  the  gamma  and  Weibull,  to  determine  which  is  the  most 
appropriate  model  for  a given  set  of  failure  data. 

Scope  of  the  Research 

This  thesis  was  restricted  to  a literature  search 
and  the  analysis  of  failure  data  obtained  from  three  iner- 
tial measurement  units  (IMU) . Failure  data  were  analyzed 

2 

using  the  SIMFIT  computer  program.  Although  there  is  a 
comprehensive  set  of  distributions  for  the  purpose  of  fit- 
ting failure  .data  within  SIMFIT,  their  number  is  limited. 
While  other  distributions  which  could  possibly  model  the 
failure  data  better  than  the  ones  contained  in  SIMFIT,  the 

2 

SIMFIT  13  a computer  program  which  tests  data  to 
determine  if  the  data  follow  a theoretical  distribution. 

An  explanation  of  SIMFIT  is  contained  in  Appendix  3. 
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most  common  and  most  widely  used  model,  nam.ely  the  exponen- 
tial, is  contained  within  SIMFIT.  This  distribution  was  of 
primary  interest  throughout  the  research. 

An  additional  limitation  may  be  the  SIMFIT  computer 

2 

program's  use  of  the  nonparametric  Chi-square  (y  ) and  the 

Kolomogorov-Smirnov  (K-S)  Goodness-Of-Fit  (G-O-F)  tests. 

However,  these  nonparametric  curve  fitting  methods  reflect 

3 

the  current  state  of  the  art. 

Objective  of  the  Research 

The  objective  of  this  thesis  was  to  investigate  and 
com.pare  the  distribution  of  actual  failure  rates,  computed 
using  operational  field  data,  with  theoretical  distributions 
of  expected  failure  rates.  Failure  data  for  independently 
failing  units  were  collected  and  used  to  compute  param.eters 
for  field  data  to  compare  with  those  specified  in  the  con- 
tract at  the  time  of  unit  acquisition.  The  fact  that  these 
units  were  renewed  after  each  failure  had  to  be  considered 
during  the  evaluation  procedure. 

The  major  research  task  was  to  determine  if  the  expo- 
nential distribution  was  the  appropriate  statistical  m>odel 
for  each  renewal  cycle  in  the  life  of  the  population. 


K-S 


and 


^See  Appendix  C 
G-C-F  tests. 


a complete  discussion  of  the 
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Research  Hypothesis 

The  distribution  of  failures  for  some  replaceable/ 
repairable  units  follow  a statistical  distribution  which 
differs  from  the  exponential  distribution  model. 
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CHAPTER  II 


RELIABILITY  CONCEPTS  AND  METHODS 


As  components/systems  have  become  more  complex,  the 
applicable  reliability  concepts  (and  models)  have  become 
increasingly  more  sophisticated.  This  component/system 
complexity  has  increased  the  degree  and  number  of  mathe- 
matical concepts  needed  to  understand  modern  reliability 
theory.  These  reliability  concepts  require  a basic  under- 
standing of  probability  and  statistical  theory  because  both 
the  laws  of  probability  and  the  techniques  of  statistics 
are  needed  to  accurately  describe  and  model  failures  of 
given  units. “ 

This  chapter  describes  the  reliability  and  statis- 
tical concepts  that  are  necessary  to  understand  the 
research  effort.  First,  the  concept  of  a hazard  rate  is 
presented  with  a discussion  of  how  the  gamma  and  Weibull 
distributions  can  be  used  to  m.odel  different  hazard  rates. 
Next,  the  exponential  distribution  is  shown  to  be  a special 
case  of  the  Weibull  and  gamma  models  when  their  paramieters 
are  appropriately  chosen  to  represent  a constant  hazard 


■^A  development  of  the  probability  concepts  and  sta- 
tistical techniques  used  in  this  thesis  can  be  found  in 
most  reliability  texts  such  as  Shoom.an  (26). 


rate.  Following  the  discussion  of  the  exponential  model, 
the  concept  of  renewal  is  introduced.  Finally,  attention 
is  directed  to  the  SIMFIT  computer  program,  which  was  used 
to  evaluate  the  research  hypothesis  concerning  the  inappro- 
priate use  of  the  exponential  distribution  as  a model  for 
failure  data. 


Hazard  Rate 

One  of  the  most  useful  tools  in  the  study  of  relia- 
bility is  the  concept  of  a hazard  rate,  H(t),  defined  as 
the  conditional  probability  of  failure  in  the  time  interval 
(t,  t + At)  , given  that  the  unit  has  survived  up  to  tim.e  t. 
Stated  another  way,  H(t)  is  the  ratio  of  the  expected  num- 
ber of  failures  in  an  interval  of  time  to  the  number  of 
units  presently  operating  during  that  interval  of  time 
(13:83) . 

The  hazard  rate  is  mathematically  defined  as 


H(t) 


- lim 
At->0 


n (t)  - n (t  + At) 
n (t) At 


where  n(t)  = N • R(t),  and  N represents  the  number  of  units 
surviving  at  time  t.  At  represents  an  increment  of  time, 
and  R(t)  is  the  reliability  of  a unit  at  the  beginning  of 
the  interval  (t,  t + At) . A more  common  form  of  the  hazard 
rate  used  in  reliability  is 

f (t) 


H(t)  = 


R(t) 
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where  f (t)  is  the  probability  density  function  of  the  unit 
(26:182) . 


The  hazard  rate  is  an  instantaneous  failure  rate  of 
a unit  under  consideration.  If  it  can  be  shown  that  the 
failure  rate  for  a given  unit  is  independent  of  time,  then 
the  unit  is  said  to  have  a constant  hazard  rate.  One  reason 
for  the  use  of  the  hazard  rate,  particularly  the  constant 
hazard  rate,  is  that  it  allows  relatively  easy  computation 
and  easily  interpreted  graphs.  Figure  2.1a  depicts  the 
graphs  of  a constant  hazard  rate  (CHR) , as  well  as  those  of 
an  increasing  hazard  rate  (IHR)  and  a decreasing  hazard 
rate  (DHR) . 

A constant  hazard  rate  implies  that  the  probability 
of  failure  in  any  time  period  remains  constant  throughout 
a given  unit's  lifetime.  For  many  mechanical  units,  which 
generally  deteriorate  or  wear  out  over  time,  the  hazard 
rate  is  not  constant,  but  increasing.  On  the  other  hand, 
many  electronic  units/systems  often  exhibit  a decreasing 
hazard  rate  which  is  the  phenomenon  referred  to  as  infant 
mortality  (26:135-9).  For  a system  composed  of  units  which 
have  different  individual  hazard  rates,  a system  hazard 
rate  will  result.  The  system  hazard  rate  consisting  of 
first  a decreasing  hazard  rate,  then  a constant  hazard  rate, 
and  finally  an  increasing  hazard  rate  has  been  named  the 
"bath  tub"  curve  (Figure  2.1b)  (26:194). 
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Erickson  and  Hammond  have  shown  that  two  situations 


lead  to  the  use  of  a CHR.  The  first  case  occurs  when  the 
system  is  composed  of  units  which  all  have  constant  hazard 
rates.  The  other  case,  which  is  more  common,  occurs  when  a 
complex  system  with  components  having  different  hazard 
rates  achieves  steady  state  (11:24).  Various  studies  have 
shown  that  the  constant  hazard  rate  is  more  applicable  for 
a highly  complex  system  or  for  large  pools  of  equipment 
after  their  attainment  of  steady  state  conditions  (2:18-22; 
6:78;  19:49). 

For  an  example  of  a constant  hazard  rate,  suppose 
that  at  the  beginning  of  a given  time  period,  100  units  are 
operational.  After  some  time  period  (ten  hours) , the  units 
are  surveyed  and  only  90  units  are  found  to  be  operational 
(10  units  or  10%  have  failed) . If  the  units  have  a constant 
hazard  rate,  the  sam.e  percentage  of  the  remaining  opera- 
tional units  will  fail  within  the  next  tine  period  of  ten 
hours.  Therefore,  assuming  that  none  of  the  failed  units 
are  replaced,  one  would  expect  at  the  next  survey  of  units 
to  find  81  operational  units.  In  summary,  if  ten  out  of 
the  100  units  failed  in  the  time  interval  0 to  t^^,  and  if 
nine  units  out  of  the  remaining  90  units  failed  in  the  equal 
tine  interval  tj^  to  t2 , etc.,  then  the  units  exhibit  a con- 
stant hazard  rate. 


Weibull  Distribution 


The  fact  that  the  failures  of  a component  population 
may  exhibit  properties  of  the  "bath  tub"  curve  or  other  mix- 
tures of  hazard  rates  has  generated  a great  deal  of  interest 
about  the  Weibull  and  gamma  distributions,  within  the  world 
of  reliability.  The  Weibull  model  is  not  only  capable  of 
representing  a constant  hazard  rate  but  also  can  be  used  to 
model  an  increasing  hazard  rate  or  a type  of  decreasing 
hazard  rate  by  the  appropriate  selection  of  the  model  param- 
eters (26 ; 190 ) . 

The  Weibull  distribution  does  exhibit  two  drawbacks. 
First,  the  model  is  a two  parameter  model,  which  means  it 
is  more  difficult  to  estimate  the  parameters.  But  with  the 
availability  of  modern  computers  and  talented  programmers, 
this  drawback  should  be  minor.  The  second  drawback  is  with- 
in the  model  itself.  The  Weibull  model  cannot  accurately 
represent  the  linearly  decreasing  hazard  rate,  which  is 
useful  in  describing  early  failures.  "However,  with  an 
appropriate  choice  of  k and  m ;_the  model's  parameters]  one 
should  be  able  to  minimize  this  effect  [26:190J." 
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The  Weibull  density  function  is  mathematically 


defined  as 


f (t) 


kt  exp 


(m+1)  . 


2 

where  m and  k are  the  models'  parameters. 

In  the  Weibull  model,  if  m = 0,  the  resulting  dis- 
tribution is  the  exponential  distribution.  Therefore,  the 
exponential  distribution  can  be  thought  of  as  a special 
case  of  the  Weibull  distribution,  or  alternatively,  the 
Weibull  distribution  can  be  thought  of  as  a more  general 
case  of  the  exponential. 


Gamma  Distribution 

Another  failure  model  which  proved  to  be  important 
during  this  research  was  the  two  parameter  form  of  the 
gamma  distribution.  Similar  to  the  Weibull  model,  the 
gamma  distribution  has  the  ability  to  model  different 
hazard  rates,  including  the  constant  hazard  rate,  for  vari- 
ous component  populations. 

The  gamma  density  function  is  mathematically  defined 


as 


f(t)  = 


B r 


(a  + 1) 


exp 


For  additional  development  of  the  Weibull  distri- 
bution see  Appendix  D. 


where  a>-l,  £5>0,  0^t^<=°,  and  F (x)  = (x  - 1)!  , 
when  X is  an  integer  (23:4-37). 

If  a = 0,  the  exponential  distribution  is  obtained 
as  a special  case  of  the  gamma  distribution.  And,  if  a is 
defined  as  being  a positive  integer,  the  resulting  distri- 
bution is  called  the  Erlang  distribution.^ 

Exponential  Distribution 

Since  the  exponential  distribution  is  a special 
case  of  the  Weibull  or  gainma  distributions,  one  may  conjec- 
ture that  if  an  underlying  distribution  has  been  assumed 
exponential  by  an  oversimplification  of  reality,  then  a 
mors  appropriate  distribution  might  be  the  Weibull  or  gamma. 
But  the  fact  is,  however,  reliability  analysis  of  failures 
has  concentrated  on  the  concept  of  a constant  hazard  rate. 

To  assume  a component/system's  hazard  rate  is  constant 
implies  that  the  failures  of  the  component/system  follow 
an  exponential  distribution. 

A comprehensive  analysis  of  failure  distributions 
by  the  RAUD  Corporation  in  the  early  1950 's  discovered  that 
the  exponential  distribution  accurately  described  the  fail- 
ure characteristics  for  a wide  variety  of  devices  (17:209). 
The  list  of  applications  where  the  exponential  distribution 

^See  Appendix  D for  further  development  of  the 
gamma  and  Erlang  distributions  and  a summarized  discussion 
of  other  statistical  distributions  considered  important  to 
the  analysis  of  the  IMU  failure  data. 
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could  be  used  grew  rapidly,  but  there  were  argur-ents 
against  the  universal  use  of  the  exponential  distribution. 

The  exponential  law  is  reasonable  appropriate  where 
chance  alone  dictates  failure  occurrence.  If  it  is 
known,  for  example,  that  failure  is  consistently  due  to 
deterioration,  wearout,  degradation  fatigue,  or  any 
repetitive  mechanism,  almost  certain  nonexponentiality 
is  implied  ....  To  be  truly  random,  failure  cannot 
be  due  to  design  deficiencies  or  manufacturing  errors. 
Being  assignable  as  to  cause,  failure  distributions 
associated  with  early  life  of  a product  usually  are, 
not  exponential  in  that  they  are  not  random  L21:194J. 

And 

. . . it  is  to  be  emphasized  that  considerable  care 
is  required  in  the  selection  of  the  appropriate  under- 
lying distribution  for  reliability  testing.  The  valid- 
ity of  the  test  results  depends  to  a large  degree  upon 
how  well  the  selected  probability  distribution  repre- 
sents the  actual  distribution  of  the  time  to  failure 
upon  which  observations  are  being  made  ....  This 
point  is  emphasized  because  the  widespread  (perhaps 
indiscriminate)  use  of  the  exponential  distribution  as 
a model  of  failure  patterns  may  lead  one  to  believe 
that  failure  times  in  general  may  be  adequately  repre- 
sented by  such  a distribution  ....  [7:3-6j. 

The  exponential  distribution  is  a consequence  of 
the  assumption  that  the  probability  of  failure  in  a given 
time  interval  is  directly  proportional  to  the  length  of  the 
interval  and  is  independent  of  the  age  of  the  unit.  This 
assumption  means  that  individual  failures  occur  in  a random 
or  unpredictable  manner  and  are  not  caused  by  design  imper- 
fections (infant  mortality)  or  deterioration  (wearout) 
(26:24) . 
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he  exponential  density  function  is  mathematically 


defined  as 

f (t)  = 1 exp  L - It]  , t ^ 0 

and  the  reliability  function  for  the  distribution  is  given 
as 

R(t)  = expL-  '\tj, 

where  l/X  = 0 is  the  Mean  Time  to  Failure  (MTTF) . The 

expected  value  of  the  exponential  distribution,  E(t)  = 

2 

while  the  variance,  V(t)  = 0 and  the  standard  deviation  is 
equal  to  0.  A very  interesting  relationship  concerning  the 
exponential  parameter,  MTTF,  is  that  it  is  equal  to  the 
standard  deviation.  This  relationship  further  facilitates 
the  simplicity  in  using  the  exponential  distribution. 

The  MTTF  is  itself  a popular  measure  of  reliability 
as  is  the  MTBF. 

The  MTBF  has  meaning  only  when  one  is  discussing  a 
renewal  situation,  where  there  is  repair  or  replacement. 
. . . Unfortunately  these  two  quantities  (]mttf  & MTBF  i 
are  sometimes  w’rongly  thought  of  as  equivalent,  probablv 
because  for  certain  simple  constant-hazard  cases  they 
are  equal.  In  a single-parameter  distribution,  speci- 
fication of  the  MTTF  fixes  the  parameter  j]26:197]. 

In  multiple  parameter  distributions,  such  as  the  Weibull 

and  gamma,  the  MTBF  places  constraints  on  only  one  of  the 

model’s  parameters.  Whereas,  the  MTBF  completely  constrains 

the  single  parameter  exponential  distribution. 
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Renewal 


The  Mean  Time  Between  Failures  is  so  frequently 
encountered  in  reliability  that  an  understanding  of  the 
concept  of  renewal  or  repair  of  a unit  is  vital.  Physi- 
cally, renewal  implies  that  when  a unit  fails,  replacement 
is  normally  made  with  an  identical  operable  unit  or  the 
failed  unit  is  repaired  by  maintenance  actions  which  com- 
pletely restores  the  unit's  operational  capabilities.  This 
repair  or  replacement  of  failed  units  is  called  a renewal 
process  (13:97) . 

Since  renewal  is  a mathematical,  as  well  as  a physi- 
cal concept,  a more  quantitative  development  of  the  renewal 
concept  is  desirable.  For  example,  let  t^^  be  the  time  of 
the  first  failure  of  a given  unit;  at  which,  the  first  unit 
is  either  replaced  by  another  operable,  identical  unit  or 
completely  repaired.  In  reneval  theory,  the  downtime  for 
the  removal  and  replacement  process  and  any  shelf  life  in 
the  supply  system  of  a unit  is  usually  assumed  to  be  negli- 
gible. If  the  second  unit  (or  renewed  unit)  begins  oper- 

I 

ation  at  time  t^^  and  eventually  fails  at  t^^,  the  total 

/ 

operating  time  is  given  by  t^^  - t^  = t2  time  periods.  The 
procedure  continues  with  unit  three  (or  unit  one  renewed 
for  the  ser  ,'nd  time)  operating  t^  time  periods,  etc.  The 
system  operating  time  T^^  for  n - 1 renewals  is  given  by 


rp 

“n 


+ t. 


+ 


+ t 


n ' 
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th. 

where  T is  the  tine  of  the  failure  and  t is  the  tine 

n n 

from  the  (n-1)  failure  to  the  failure. 

Each  f (T  ) is  now  a density  function  for  n renewals 
n 

computed  from  the  individual  densitv  functions  f (t  ) . 

- n n 

This  procedure  is  called  a convolution.  The  convolution  of 
a set  of  individual  density  functions  implies  that  a random 
observation  is  extracted  from  each  of  the  individual  den- 
sity functions  and  then  summed.  These  s\imLmations  are  now 
random  observations  in  their  own  rights  of  the  convoluted 
density  function  T^^  (26  :350-9). 

For  example,  assume  that  a population  of  operable 
identical  units  exists  in  which  only  one  unit  is  needed  to 
satisfactorily  perform  a particular  m.ission.  If  unit  one 
is  initially  placed  in  operation  at  tim.e  0 and  fails  after 
15  hours,  the  total  operating  time  of  unit  one  is  t^  = 15 
hours.  If  unit  one  is  replaced  by  unit  two  and  subse- 
quently fails  after  operating  26  hours,  the  total  operating 
time  for  unit  two  is  t2  = 26  hours.  Unit  two  is  now 
replaced  by  unit  three  and  after  nine  hours,  unit  three 
fails  with  the  total  operating  time  for  unit  three  being 
t3  = 9.  If  the  example  is  stopped  at  this  point,  the  sys- 
tem operating  time  is 

= t,  + to  + t,  = (15  + 26  + 9)  hours  = 50  hours. 

Now  if  one  assurries  that  the  population  contains  m units  and 
that  the  operating  times  for  tj^,  t2,  and  t,  were  randomly 
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dra^vn  from  failure  cycles  1,  2,  and  3 respectively,  T can 

n 

be  conceptualized  as  a convolution. 

, In  a renewal  environment  the  exponential  distri- 

bution model  is  often  used  for  describing  failures  of  units 
because  the  model  is  characterized  by  a "complete  lack  of 
memory"  property, 

P{X  >r+s  I X>r}=  P{x  > s}. 


where  r and  s are  any  positive  numbers  (12:156). 

This  means  that  P{x>s]  is  independent  of  r.  In 
other  words,  if  a piece  of  equipment  has  not  failed 
during  r time  units  its  conditional  probability  of 
serving  r+s  or  more  time  units  is  independent  of  r and 
is  equal  to  the  probability  of  serving  s or  m.cre  time 
units.  Stated  differently,  if  time  to  failure  of  a 
piece  of  equipment  follows  the  exponential  distri- 
bution, _then  aging  of  the  equiom.ent  is  immaterial 
Ll2:157j. 


The  lack  of  memory  property  of  the  exponential  model 

greatly  encourages  the  distribution's  use  in  considering 

renewed  units.  If  a unit  is  assumed  to  have  a constant 

hazard  rate,  then  when  observed  at  any  time  t,  such  that, 

(t  > 0) , the  unit  will  theoretically  be  as  good  after 

renewal  as  the  unit  was  initially.  Therefore,  renewal  of 

an  exponentially  distributed  unit  implies  that  it  will 

th 

perform  the  sam.e  after  the  1000  ' renewal  as  it  did  ini- 
tially. 

For  exam.ple,  consider  the  system  of  100  operational 
units  explained  previously.  If  the  ten  units  which  failed 
in  the  first  time  period  0 to  t^^  are  completely  renewed 
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instantly,  then  100  units  are  again  operational  at  t^. 

This  constant  hazard  system  with  complete  renewal  would 
j then  predict  ten  units  failing  in  the  equal  time  period 

tj^  to  t2"  In  fact,  the  lack  of  memory  property  and  com- 
plete renewal  allows  one  to  assume  that  ten  units  would 
fail  in  any  equal  time  period  selected. 

Armed  with  such  reliability  concepts  as  the  hazard 
rate,  renewal,  and  underlying  failure  distributions, 
attention  is  shifted  to  the  basic  i for  performing  the 
research. 

Research  Plan 

Failure  data  for  three  Inertial  Measurement  Units 
(I!4U)  were  obtained  from  the  G078C  Data  Collection  System. 
This  data  collection  system  is  maintained  by  the  Aerospace 
Guidance  and  Metrology  Center  (AGMC)  at  Newark  AFS,  Ohio. 
These  raw  failure  data  were  used  to  extract  useable  failure 
data,  such  as  operating  times  and  failure  cycle  numbers  for 
the  IMU.  After  the  data  extraction  process,  a graphical 
curve  fitting  procedure  was  needed  to  aid  in  the  selection 
of  the  failure  data's  underlying  failure  distribution. 

Thus,  the  SIMFIT  computer  program,  a 'state  of  the  art' 
curve  fitting  technique,  was  utilized  in  the  comparison  of 
the  failure  data  with  theoretical  distributions. 

Two  very  important  reasons  for  establishing  an 
underlying  distribution  for  a gi*ren  set  of  failure  data  are; 
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1.  The  implication  that  future  failure  patterns 


will  be  identical  with  the  failure  patterns  observed  in  the 
past  (7  : 3-6) . 

2.  The  ability  to  use  statistical  techniques  and 
methods  based  on  known  distributions  in  describing  the 
reliability  of  a given  unit. 

The  SIMFIT  computer  program  performs  a curve  fit- 
ting evaluation  routine  by  performing  two  nonparametric 

goodness-of-f it  tests.  These  nonparametric  tests,  the 

2 

Kolmogorov-Smirnov  (K-S)  and  the  Chi-square  (x  ) Goodness- 

Of-Fit  (G-O-F)  tests,  are  used  due  to  their  distribution- 
4 

free  property.  This  property  and  the  availability  of  the 
2 

X-S  and  x test  results  motivated  the  use  of  SIMFIT  for 

2 

data  analysis.  The  results  of  the  K-S  and  x tests  were 

used  to  accept  or  reject  a particular  distribution  as  the 

underlying  distribution  for  a given  set  of  failure  data  at 

the  90%  confidence  level.  The  statistical  decision  rules 

were  as  follows; 

1.  Hypothesis  Statement (s) 

Null  X ~ the  hypothesized  distri- 

bution with  the  desired 
parameter  (s ) . 

Alternate  X / the  hypothesized  distri- 

bution with  the  desired 
parameter (s) . 


4 

A basic  description  of  nonparametric  statistics  is 
contained  in  Appendix  C. 
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2.  Rejection  Decision  Criteria  using  the  K-S  G-O-F 
Test  Results  from  SIMFIT: 

If  the  calculated  value  of  D is  greater  than 
the  critical  value  of  D,  reject  the  null 
hypothesis  at  the  specified  level  of  confi- 
dence . 

2 

3.  Rejection  Decision  Criteria  using  the  y G-O-F 
Test  Results  frora  SIMFIT: 

2 

If  the  calculated  value  of  y is  greater  than 
the  critical  value  of  y^,  reject  the  null 
hypothesis  at  the  specified  level  of  confi- 
dence . 

Since  the  SIMFIT  computer  program  employs  both  of  these 

G-O-F  tests,  a decision  was  necessary  in  the  event  that  the 

results  of  the  two  tests  were  not  the  same.  Therefore,  in 

this  research,  if  a distribution  passed  either  the  K-S  or 
2 

the  X G-O-F  Test  (or  both) , the  failure  data  distribution 
was  accepted  as  following  the  theoretical  distribution  with 
the  distribution  parameters  calculated  by  SIMFIT. 

In  order  to  determine  if  the  research  hypothesis 
was  supported,  the  SIMFIT  results  from  each  particular  IMU 
population  were  analyzed.  The  analysis  was  accomplished  by 
grouping  the  failure  data  by  failure  cycle.  The  best  model 
for  a particular  population  was  considered  to  be  the  theo- 
retical distribution  that  fit  the  largest  percentage  of 
failure  cycles.  If  the  best  model  was  not  the  exponential 
for  any  of  the  three  IMU  populations  tested,  the  research 
hypothesis  was  supported.  Conversely,  if  the  best  model 
for  each  population  was  the  exponential  distribution,  then 
the  research  hypothesis  was  not  supported. 
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Research  Assumptions 

The  following  assumptions  were  made  for  the 
research  effort: 

1.  The  sample  units  are  independent  of  each  other; 
therefore,  the  failure  of  one  unit  has  no  affect  on  the 
failure  of  any  other  unit  in  the  sample. 

2.  The  renewal  process  is  a function  of  the 
renewed/replaced  unit.  The  effects  of  any  improper  main- 
tenance actions  were  not  considered.  "Conceptually,  main- 
tenance malpractice  is  a random  variable  [8:32!." 

3.  SIMFIT  selected  the  best  parameters  for  a given 
distribution  using  the  optimal  number  of  cells  and  cell 
widths. 

4.  The  data  provided  by  the  G078C  Data  Collection 
System  was  not  contaminated. 

5.  The  populations  under  study  were  subjected  to  a 
constant  maintenance  policy. 

In  the  following  chapter,  the  procedures  necessary 
to  accomplish  the  SIMFIT  analysis  are  described  in  the 
sequence  in  which  they  were  actually  performed  during  the 
research  effort. 
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CHAPTER  III 


METHODOLOGY 

The  objective  of  the  research  was  to  compare  actual 
field  data  with  theoretical  distributions  to  determine  if 
the  exponential  model  was  indeed  the  appropriate  failure 
model.  To  accomplish  this  objective,  the  reliability  con- 
cepts discussed  in  Chapter  II  were  employed.  In  addition, 
a methodology  was  developed  to  collect  and  analyze  the 
failure  data.  This  chapter  explains  the  procedure  v/hich 
were  used  to  mieet  the  research  objective. 

The  population  of  units,  from  which  the  failure  data 
were  collected,  is  addressed  initially,  along  with  the  cri- 
teria used  for  sample  determination.  Next  the  data  col- 
lection procedure,  as  well  as,  a summ.ary  of  the  G078C  Data 
Collection  System  from  which  the  failure  data  was  extracted, 
is  presented.  Finally  the  techniques  of  analysis  which 
were  used  are  discussed. 

Population  and  Sample 

The  universe  of  the  research  effort  was  defined  to 
be  the  Inertial  Measurement  Units  repaired  by  the  Aerospace 
Guidance  and  Metrology  Center  (AGMC) . From  this  universe 
three  populations  were  identified  for  analysis.  The  nomen- 
clatures associated  with  the  populations  are:  the  FLIP 
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unit  used  on  the  C-5A  aircraft,  the  LN-15  unit  used  on  the 
E-52G/H  aircraft,  and  the  KT-73  unit  used  on  both  the  A-7D/E 
aircraft  and  the  AC-130K  aircraft  (29:5A).  For  clarity  and 
ccnsistency  the  ncmenclatures  FLIP,  KT-73,  and  LN-15,  used 
by  the  G078C  Data  Collection  System,  were  emiployed  through- 
out this  research.  All  available  failure  data  on  these 
three  populations  were  obtained  from  the  AGMC  for  analysis. 

The  FLIP,  LN-15,  and  KT-73  populations  appeared  to 
have  been  subjected  to  a constant  maintenance  pclicy  during 
the  data  collection  period.  This  policy  involved  the 
removal  and  replacem.ent  of  any  unit  at  the  time  of  failure. 
Such  maintenance  was  performed  at  the  flight  line  level. 

The  failed  units  were  then  sent  to  AGNC,  where  the  failures 
were  verified  and,  if  necessary,  the  units  were  repaired. 
Repair  consisted  of  unsealing  the  hermetically-sealed  unit, 
removing  and  replacing  (R  & R)  failed  sub-units,  resealing 
the  unit,  and  performing  an  operational  checkout  (18) . The 
unit  was  then  returned  to  the  supply  channel  (See 
Figure  3.1). 

The  assum.ption  of  a constant  maintenance  pclicy  was 
critical  to  the  research  because  if  the  maintenance  policy 
changes  during  the  collection  of  data,  the  observed  distri- 
bution may  also  change.  If  the  m.aintenance  policy  had 
charged,  the  research  assumption  of  a constant  maintenance 
policy  stated  in  Chapter  II  would  not  have  been  met. 

32 


i 


Figure  3.1.  Simplified  Repair  Flow  for  an  IM(i 


Each  individual  unit  cf  each  population  under  study 
had  an  elapsed  time  indicator  and  an  identification  serial 
number.  The  elapsed  time  indicator  was  necessary  to  insure  j 

the  length  of  time  that  the  components  were  awaiting  repair,  | 

or  were  in  the  supply  channels,  was  not  added  to  the  oper-  ! 

ating  time  of  the  units.  The  identification  serial  number 
was  required  to  identify  the  failure  cycles  associated  with 
any  particular  unit. 

Initially,  the  sample  used  for  analysis  was  desig- 
nated as  a failure  cycle  sample.  A failure  cycle  for  a . 
unit  was  defined  to  be  the  unit's  operating  time  from 
renewal  to  failure;  i.e.,  failure  cycle  one  was  the  time 
from  the  unit's  acquisition  until  its  first  failure,  failure 
cycle  two  was  the  time  from  the  unit's  renewal  after  its 
first  failure  until  its  second  failure,  etc.  (See 
Figure  3.2).  A failure  cycle  sample  was  subjectively 

determined  to  be  the  sample  of  failures  drawn  from  a given  i 

IMU  population  for  a particular  cycle.  All  failure  data  ' 

used  in  this  research  were  collected  from  the  time  of  each 
unit's  acquisition  to  1 February  1977.  Due  to  a modifi- 
cation of  the  FLIP  unit,  no  failure  data  for  that  unit  was 
used  that  was  collected  after  1 May  1976. 

, The  sample  size  was  established  to  be  equal  to  the 
number  of  failures  for  a particular  failure  cycle  from  the 
AGMC  failure  data.  The  sample  size  for  any  given  failure 
cycle  varied  with  the  population  under  study.  Several  of 
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the  failure  cycles  had  as  few  as  ten  failures.  Any  result 


obtained  from  such  a small  sample  would  be  questionable; 
therefore,  for  the  purpose  of  this  research,  all  failure 
cycles  which  contained  data  on  less  than  forty-eight  fail- 
ures were  discarded  (See  Table  3.1). 

Data  Collection 

The  failure  data  for  the  three  IMUs  v^ere  collected 
at  the  AGMC  in  Newark  AFS,  Ohio.  This  location  is  the 
military's  only  repair  facility  for  these  units.  Failure 
data  were  obtained  from  the  G078C  Data  Collection  System 
which  is  normally  used  by  reliability  engineers  to  isolate 
and  resolve  identified  reliability  problems  (18;29). 

The  G078C  Data  Collection  System  required  the 
recording  of  failure  data  on  a five-card  type  format.  Only 
card  type  one  contained  data  useful  in  this  research,  such 
as  operating  hours,  number  of  the  failure  cycles  for  each 
unit,  etc.;  therefore,  only  card  one  data  was  obtained  and 
analyzed. 

The  data  were  sorted  by  population  type  and  failure 
cycle.  The  hours  of  operation  of  each  unit  within  each 
population  were  extracted  from  the  data  cards  for  every 
failure  cycle.  These  operating  times  were  then  placed  into 
data  files  by  population  and  failure  cycle.  These  data 
files  were  called  failure  cycle  samples  and  constituted  the 
data  base  entered  into  SIMFIT  for  the  analysis.  Individual 

failure  cycle  operating  times  are  included  in  .Appendix  E. 
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Table  3.1 


Sunorary  of  Sarrcle  Sizes 


Failure 

Cycle 

Number 
of  FLIP 
Failures 

Number 
of  KT-73 
Failures 

Number 
of  LM-15 
Failures 

1 

145 

554 

316 

2 

136 

451 

213 

J 

127 

354 

117 

4 

124 

258 

49 

5 

118 

185 

26 

6 

110 

120 

18 

7 

96 

42 

7 

8 

82 

23 

4 

9 

62 

11 

2 

10 

49 

8 

0 

11 

27 

1 

0 

12 

11 

0 

0 

Note;  All  samples  with  less  than  forty-eight  data 
points  were  discarded. 


37 


■ 





The  validity  of  SIMFIT's  test  is  dependent  upon  the 
accuracy  of  the  data  collected  by  AGMC  via  the  G078C  Data 
Collection  System.  Data  validity  was  a research  goal;  but, 
realistically,  the  possibility  existed  that  the  data  could 
have  been  contaminated  for  one  unit  or  one  cycle.  After  an 
extensive  search  for  accurate  and  useable  data,  the  decision 
w’as  made  to  use  the  G078C  data  simply  because  no  better, 
nonaggregated  data  were  available. 

Technique  of  Analysis 

Best  model  criteria.  The  failure  cycle  sample 
operating  times  were  entered  into  the  SIMFIT  com.puter 
program  for  each  population  and  oompared  to  the  theoretical 
distributions  that  were  of  interest  to  this  research  effort; 
nam.ely,  the  exponential.  Erlang,  VJeibull,  gamma,  Pearson  XI, 
lognormal,  normal,  beta,  and  negative  binomial.  The  sug- 
gested values  calculated  by  the  SIMFIT  program  v;ere  used  to 
establish  the  number  of  cells  and  the  width  of  each  cell. 

All  distributions  were  tested  at  the  90%  confidence  level 
because  "current  military  specifications  in  contracts 
dealing  with  reliability  append  a 90%  confidence  factor  to 
the  specified  reliability  measure  . . . ” 4 : 5] . " 

One  of  the  outputs  from  the  SIMFIT  program  was  a 
histogram  of  the  data  that  were  entered.  The  histogramis 
obtained  for  the  failure  cycle  samples  are  contained  in 
Appendix  F. 
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other  SIMFIT  outputs  included  the  results  of  the 
K-S  and  x**  G-O-F  tests.  If  one  of  these  G-O-F  tests 
"passes"  a given  theoretical  distribution,  then  the  data 
sample  could  have  been  drawn  from  that  distribution.  Con- 
versely, if  both  of  these  G-O-F  tests  "fail"  a given  theo- 
retical distribution,  then  the  data  sample  could  not  have 
been  drawn  from  that  distribution.  Of  course,  G-O-F  tests 
are  dependent  upon  the  confidence  level  desired  in  the  pass/ 
fail  determination.  A 90%  confidence  level  was  used  in 
this  research.  These  test  results  v/ere  tabularized  in  order 
to  determine  the  best  model  for  each  populauion. 

Data  aggregation.  Another  approach  to  analyzing 
the  failure  data  of  each  population  was  to  aggregate  all 
failure  cycle  samples  of  a given  population  into  one  data 
file  of  operating  times.  In  essence,  there  was  one  aggre- 
gation of  failure  data  for  each  population.  The  data 
aggregation  for  each  population  was  then  tested  using 
SIMFIT  to  determine  any  underlying  distribution. 

Convoluted  sampling.  An  alternate  approach  to 
analyzing  the  appropriateness  of  the  exponential  assumption 
was  the  method  of  convolution  of  data.  The  convolution 
approach  was  used  to  make  the  analysis  more  complete  and  to 
possibly  add  support  to  the  results  of  the  previous  analy- 
sis. The  disadvantage  of  this  approach  was  that  the  expo- 
nential assumption  can  only  be  disproven  by  this  m^ethod. 
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In  other  words,  this  was  a negative  test  which  could  only 
show  that  the  exponential  assumption  was  inappropriate. 

The  convolution  was  accomplished  by  random  sampling 
with  replacement  from  each  failure  cycle  sample  of  the 
specified  population.  The  operating  times  drawn  from  each 
failure  cycle  were  summed  and  the  summation  was  placed  into 
a data  file  to  be  used  by  SIMFIT.  Each  of  these  data  files 
was  called  a convoluted  sample.  See  Figure  3.3  for  a pic- 
torial representation  of  convolution.  To  remain  consistent 
with  previous  analysis  of  data,  all  convoluted  sam.ples  con- 
tained more  than  forty-eight  data  points . 

Reliability  comparison.  If  the  underlying  distri- 
bution for  a data  sample  and/or  population  can  be  discovered, 
then  other  analysis  based  on  the  param.etric  statistics  of 
that  distribution  may  be  performed  (22).  One  such  method  of 
analysis  v;as  made  by  comparing  the  parameters  calculated  by 
the  SIMFIT  program,  for  the  actual  failure  distribution  with 
the  param.eters  specified  by  the  contract  at  the  time  of 
acquisition.  Difficulty  was  encountered  in  researching  the 
contract  reliability  specification  for  the  FLIP  unit;  there- 
fore, no  reliability  comparison  was  made  between  the  actual 
and  specified  reliability  for  this  unit. 

' The  reliability  R(t)  and  hazard  rate  H(t)  were 
graphically  compared  for  certain  failure  cycles.  The  pur- 
pose was  to  determine  if  there  were  any  significant 
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Figure  3.3.  A Process  of  Convolution 
for  IxMU  Failure  Data 
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differences  between  the  reliability  of  the  actual  failure 
distributions  and  the  contract  specified  reliability  which 
i assumed  the  exponential  distribution. 

Once  the  research  methodology  was  formulated,  the 
data  from  the  G078C  Data  Collection  Systeiu  were  entered 
into  SIKFIT  under  the  conditions  specified  in  this  chapter. 
Specific  analysis  of  the  SIKFIT  outputs  is  presented  in  the 
following  chapter. 


CHAPTER  IV 


ANALYSIS  OF  RESULTS 

This  chapter  presents  an  interpretation  of  the  out- 
puts of  the  SIMFIT  computer  program  using  the  methodology 
described  in  Chapter  III.  First,  the  results  of  the  SIMFIT 
computer  program  are  summarized  in  tables  to  facilitate  the 
comparison  of  the  exponential  distribution  with  the  other 
distributions  tested  via  SIMFIT.  The  purpose  was  to  deter- 
m^ine  if  the  exponential  distribution  was  the  most  appropri- 
ate failure  mcdel  for  the  populations  of  Inertial  Measure- 
m.ent  Units  considered.  Next,  data  aggregation  test  results 
are  presented  and  explained,  follov?ed  by  a similar  sum.r.ary 
of  the  SIMFIT  output  obtained  for  the  convoluted  sam.ples. 
Finally,  a comparison  is  m.ade  of  the  contract  specified 
reliability  and  the  achieved  reliability  based  on  param- 
eters calculated  by  SIMFIT  from  the  LN-15  and  Y.1-12  failure 
data. 

Certain  unexpected  results  that  were  encountered 
during  the  SIMFIT  output  analysis  prompted  research  beyond 
that  which  was  specified  in  the  original  methodology.  This 
chapter  concludes  with  an  analysis  of  these  anomalies  which 
included  a relatively  high  infant  mortality  and  a shift  in 
failure  distributions  from  cycle  to  cycle. 
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Analysis 


Best  model  determination.  The  best  model  for  each 
unit  was  defined  to  be  the  distribution  which  was  able  to 
fit  the  largest  number  of  failure  cycle  samples  for  that 
unit.  The  results  of  the  SIMFIT  data  analysis  are  summa- 
rized for  the  KT-73  unit  in  Table  4.1.^  The  best  model  for 
the  KT-73  unit  was  the  gamm.a  distribution;  hence,  for  the 
KT-73  unit,  the  research  hypothesis  is  supported.  The 
exponential  model  did  not  pass  any  distribution  of  failure 
cycle  data. 

Table  4.2  summarizes  the  results  of  the  SIMFIT 
analysis  for  the  FLIP  unit.  The  FLIP  unit  failure  cycle 
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samples  were  able  to  fit  the  gamma  distribution  for  nine  of 
the  ten  failure  cycles  tested.  The  high  percentage  of  fail- 
ure cycles  that  passed  the  gamma  distribution  indicated 
that  it  also  was  the  best  model  for  describing  the  pattern 
of  failures  for  the  FLIP  unit.  Therefore,  for  the  FLIP  IMU, 


the  research  hypothesis  was  supported.  The  exponential 
distribution,  on  the  other  hand,  passed  five  of  the  ten 
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Mote:  This  table  includes  the  parameters  calculated  by  STMFIT  for  the 

distributions  v/hich  passed  a G-O-F  test. 


The  SIMFIT  output  for  the  LN-15  IMU  is  summarized 
in  Table  4.3.  The  LN-15  unit  failure  cycle  samples  were 
modeled  by  the  exponential  distribution  for  only  one  of  the 
four  cycles  considered.  Since  the  gamma,  Weibull,  Erlang, 
and  Pearson  XI  passed  two  of  the  four  cycles,  all  four  of 
these  distributions  were  considered  better  models  for  the 
LN-15  unit  than  the  exponential  distribution.  The  analysis 
of  the  LN-15  failure  data  lends  support  to  the  research 
hypothesis . 

To  help  visualize  the  goodness-cf-f it  for  the  fail- 
ure cycle  samples  to  the  theoretical  distributions. 

Tables  4.4,  4.5,  and  4.6  were  constructed.  They  provide  a 
list  of  relative  rankings  of  the  theoretical  models  based 
on  the  "pass"  or  "fail"  criteria  established  in  Chapter  II. 
The  exponential  distribution  was  not  found  to  be  the  "best" 
model  for  any  unit,  while  the  gamma  proved  to  be  the  best 
model  for  the  KT-73  and  FLIP  units.  Therefore,  in  summary, 
the  research  hypothesis  was  supported  for  all  three  units. 

The  ranked  comparison  of  the  best  model  for  all 
failure  cycle  samples  from  all  three  units  is  presented  in 
Table  4.7.  This  table  shows  that  the  gamma  distribution 
provided  the  best  overall  model  for  the  units  tested. 
Fourteen  of  the  twenty  failure  cycle  samples  tested  fit  the 
gamma  distribution.  The  exponential  model  passed  only  six 
of  the  tv/enty  failure  cycle  samples  tested  for  G-O-F  by  the 
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Mote;  This  table  includes  the  parameters  calculated  by  STMFIT  for  the 
distributions  which  passed  a C-O-F  tost. 


Table  4.4 


Ranked  Comparison  of  the  Best  Model  for  the  KT-73  Unit 


Distribution 

Number  Passed 

Total  Number 

% 

Passed 

Gamma 

3 

6 

50 

Lognormal 

2 

6 

33 

Weibull 

1 

6 

17 

Erlang 

1 

6 

17 

Beta 

1 

6 

17 

Exponential 

0 

6 

0 

Pearson  XI 

0 

6 

0 

Negative 

Binomial 

0 

6 

0 

Nomal 

0 

6 

0 

Table  4.5 


Ranked  Comparison  of  the  Best  Model  for  the  FLIP  Unit 


Distribution 

Number  Passed 

Total  Number 

% 

Passed 

Gamma 

9 

10 

90 

Weibull 

7 

10 

70 

Lognormal 

7 

10 

70 

Exponential 

5 

10 

50 

Erlang 

5 

10 

50 

Pearson  XI 

5 

10 

50 

Negative 

Binomial 

5 

10 

50 

Beta 

0 

10 

0 

Normal 

0 

10 

0 

i 
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Table  4.6 


Ranked  Comparison  of  the  Best  Model  for  the  LN-15  Unit 


Distribution 

Number  Passed 

Total  Number 

% 

Passed 

Gamma 

2 

4 

50 

Weibull 

2 

4 

50 

Erlang 

2 

4 

50 

Pearson  XI 

2 

4 

50 

Lognormal 

2 

4 

50 

Exponential 

1 

4 

25 

Negative 

Binomial 

1 

4 

25 

Beta 

1 

4 

25 

Normal 

0 

4 

0 

52 


4*  t 


Table  4.7 


Ranked 

Comparison  of  the 

Best  Model  for  All  Units 

Distribution 

Number  Passed 

Total  Number  % 

Passed 

Gamma 

14 

20 

70 

Lognormal 

11 

20 

55 

Weibull 

10 

23 

50 

Erlang 

8 

20 

40 

Pearson  XI 

7 

20 

35 

Exponential 

6 

20 

30 

Negative 

Binomial 

6 

20 

30 

Beta 

2 

20 

10 

Normal 

0 

20 

0 

SI.'-!FIT  program;  therefore,  the  exponential  distribution  was 
not  the  best  model  for  the  failure  cycles  of  the  three  IMUs. 

Data  aggregation  results.  After  testing  failure 
cycles  for  underlying  distributions,  the  SIMFIT  results 
obtained  from  the  aggregated  data  samples  were  analyzed. 

The  aggregated  data  from  each  of  the  three  IMU  populations 
did  not  fit  any  of  the  theoretical  distributions  tested  by 
SII-IFIT.  This  result  was  somev/hat  surprising  because  indi- 
vidual failure  cycle  samples  were  able  to  fit  at  least  one 
of  the  distributions  tested  for  sixteen  of  the  twenty  indi- 
vidual cycle  samples. 

Currently,  the  Air  Force  aggregates  failure  data  in 
m.ost  data  collection  systems.  The  assumption  that  failures 
follow  an  exponential  distribution,  which  was  solidified 
years  ago  by  military  standards  and  regulations  that  sup- 
ported the  "exponential  assumption,"  is  the  primary  reason 
that  data  collection  systems  currently  aggregate  data.  The 
SIMFIT  results  obtained  for  the  aagregated  IMU  failure  data 
would  seem  to  indicate  that  a data  collection  system,  based 
on  the  aggregation  of  failure  data,  is  nearly  useless  in 
trying  to  discover  (or  even  verify  as  for  the  exponential) 
an  underlying  distribution  for  the  failure  data.  If  this 
underlying  distribution  is  not  obtainable,  then  realistic 
prediction  of  operating  characteristics  of  components/ 
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systems  and  of  logistical  support  requirements  cannot  be 
made . 

Convoluted  sampling  results.  In  continuing  the 
analysis  of  the  IMU  failure  data,  SIMFIT  was  used  to  test 
randomly  drawn  convoluted  samples  from  each  of  the  three 
populations.  Although  this  analytical  procedure  would 
result  in  only  a negative-type  of  test;  i.e.,  the  exponen- 
tial assum.ption  could  only  be  disproved  (as  discussed  in 
Chapter  III) , its  inclusion  was  deemed  necessary  as  an 
alternate  source  of  possible  support  for  the  research 
hypothesis . 

The  convoluted  data  for  the  KT-73  unit  failed  the 
SIMFIT  criteria  of  passing  either  or  both  G-O-F  tests  for 
all  theoretical  tested  except  for  the  normal  distribution. 
Therefore,  since  the  convoluted  data  were  not  able  to  fit 
the  Erlang  distribution,  the  underlying  failure  model  could 
not  have  been  the  exponential  distribution.  This  finding 
supported  the  research  hypothesis. 

The  convoluted  data  for  both  the  FLIP  unit  and  the 
LN-15  unit  passed  the  Erlang  distribution.  The  fact  that 
the  convoluted  data  were  able  to  fit  the  Erlang  model  does 
not,  in  itself,  substantiate  the  assumption  that  the  under- 
lying' distribution  was  exponential  for  the  individual  fail- 
ure cycles  of  these  two  units;  therefore,  this  outcome  does 
not  contradict  earlier  support  for  the  research  hypothesis. 
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The  convolution  data  samples  used  for  this  analysis  are 
contained  in  Appendix  E and  their  SIMFIT  histograms  are 
contained  in  Appendix  F. 

Reliability  comparison  results.  The  final  phase  of 
the  planned  analysis  entailed  reliability  comparisons 
between  contract  specified  reliability  and  achieved  reli- 
ability computed  for  the  distribution  parameters  provided 
by  SIMFIT.  One  extremely  important  reason  for  accurately 
identifying  the  underlying  distribution  of  a sample  of 
failure  data  is  to  m.ake  possible  the  use  of  established 
param.etric  statistical  methods  and  techniques  to  further 
characterize  the  sample  data. 

The  sample  parameters  for  each  of  the  failure  cycles 

were  obtained  from  SIMFIT.  The  means,  standard  deviations, 

variances,  and  the  number  of  failures  observed  for  each 

2 

cycle  are  tabularized  in  Table  4.8.  After  observing  the 
means  in  this  table,  the  mean  time  between  failures 
appeared  to  be  much  less  for  all  three  units  after  the  first 
failure  than  the  mean  time  to  failure  of  the  first  cycle. 

For  instance,  the  MTBF  given  in  the  contract  for 
the  LN-15  unit  was  600  hours  (18);  however,  the  value  of 


An  interpretation  of  these  statistics  must  be  m.ade 
with  the  cognizance  that  som.e  units  may  still  be  operating 
as  of  the  data  collection  cutoff  date  while  others  may  have 
been  discarded  for  repeated  failures  or  may  be  somewhere 
within  the  supply  channel. 
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Table  4.8 


Table  of  Sample  Parameters 


Unit 

Mean 

Standard 

Deviation 

Variance 

# of  Failures 

KT-73 

Cycle  1 

818.06 

539.41 

290963.15 

554 

Cycle  2 

258.33 

306.92 

94199.89 

451 

Cycle  3 

226.85 

286.88 

82300.13 

354 

Cycle  4 

165.83 

216.34 

46803.00 

258 

Cycle  5 

133.27 

256.56 

65823.03 

185 

Cycle  6 

135.83 

209.98 

44091.60 

120 

FLIP 

Cycle  1 

900.95 

709.90 

503958.01 

145 

Cycle  2 

283.90 

490.56 

240649.11 

136 

Cycle  3 

300.43 

314.15 

98690.22 

127 

Cycle  4 

233.31 

222.14 

49346.18 

124 

Cycle  5 

258.84 

264.94 

70193.20 

118 

Cycle  6 

238.61 

283.67 

80468.67 

110 

Cycle  7 

.^28.34 

321.13 

103124.48 

96 

Cycle  8 

236.32 

262.63 

69000.78 

82 

Cycle  9 

164.31 

209.54 

43907.01 

62 

Cycle  10 

213.90 

249.08 

62040.85 

49 

LN-15 

Cycle  1 

635.07 

369.17 

136286.49 

316 

Cycle  2 

236.54 

213.92 

45761.77 

213 

Cycle  3 

119.01 

140.02 

19605.60 

117 

Cycle  4 

72.18 

167.58 

28083.06 

49 

the  MTBF  for  all  cycles  after  the  first  cycle  was  less  than 
the  600  hours  specified.  After  renewal  the  KT-73  unit,  with 
a contract  specified  MTBF  of  650  hours  (33) , also  had  a 
decreased  MTBF.  Therefore,  the  assumption  of  complete 
renev/al  apparently  was  not  valid  for  these  units. 

After  renewal,  the  actual  reliability  obtained  for 
all  cycles  was  less  than  the  specified  reliability. 

Figure  4.1  shows  the  reliability  plotted  for  parameters 
specified  in  the  contract  for  the  LN-15.  Figure  4.2  shows 
the  reliability  plotted  for  the  param.eters  obtained  from 
SIMFIT  for  the  second  failure  cycle  using  the  exponential 
distribution.  From  these  graphs  it  can  be  observed  that 
the  actual  reliability  was  considerably  less  than  that  given 
by  the  contract  specification. 

For  example,  the  probability  of  a LN-15  unit  oper- 
ating 600  hours  using  the  contract  specified  reliability 
is  37  percent.  The  probability  of  the  unit  actually  oper- 
ating 600  hours  after  the  first  renewal  is  only  7 percent. 
These  points  are  represented  by  the  dotted  lines  on 
Figures  4.1  and  4.2.  The  same  general  pattern  was  found 
present  for  the  KT-73  unit.  Both  the  KT-73  and  LN-15  units 
exceeded  the  contractor  specified  reliability  for  the  first 
failure  cycle,  but  after  renewal  the  reliability  dropped 
below  these  specified  levels. 

Although  no  contract  specified  reliability  was 
obtained  for  the  FLIP  unit,  a decrease  in  reliability  was 
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evidant  after  renewal.  Figures  4.3,  4.4,  and  4.5  of  the 
Weibull  model  and  Figures  4.6  and  4.7  of  the  exponential 
model  were  used  to  graphically  display  this  decrease  in 
reliability.  The  model  selected,  in  each  case,  had  the 
smallest  maximum  error  for  the  K-S  G-O-F  test  calculated 
by  SII4FIT  for  each  individual  failure  cycle. 

The  hazard  rates  obtained  for  the  units  were  also 
graphed  to  emphasize  that  the  actual  hazard  rate  increased 
after  renewal  and  to  show  that  the  hazard  rates  calculated 
from  the  SIMFIT  outputs  were  not  constant  from  cycle  to 
cycle.  The  graphs  of  the  hazard  rates  of  the  units  show  an 
increasing  hazard  rate  for  the  first  failure  cycle  and  a 
decreasing  hazard  rate  for  units  after  renewal. 

For  example.  Figure  4.8  shows  an  increasing  hazard 
rate  for  the  first  cycle  of  the  FLIP  unit  using  the  Weibull 
model.  The  Weibull  model  had  the  smallest  maximum  error  in 
the  K-S  G-O-F  test  for  this  cycle,  and  therefore,  was 
chosen  to  graph  the  hazard  rate.  Figure  4.9  shows  a 
decreasing  hazard  rate  for  cycle  three  of  the  FLIP  unit 
using  the  Weibull  model.  These  hazard  rate  graphs  clearly 
show  that  the  hazard  rate  was  higher  for  cycle  three  fail- 
ure data  than  for  cycle  one.  The  same  general  pattern  was 


present  for  the  KT-73  and  LN-15  units. 


Further  Analysis 


During  many  research  efforts,  unexpected  findings 
are  encountered.  At  times  these  unexpected  results  are 
more  interesting  and  valuable  than  the  original  research 
objective.  Therefore,  the  unexpected  results  are  presented 
without  full  knowledge  of  their  significance  and  with  the 
hope  that  these  findings  will  aid  future  research  efforts. 


Unexpected  results.  The  failure  data  for  each 
failure  cycle  of  the  FLIP  unit  (e.xcept  cycle  tv;o)  and  the 
LM-15  unit  passed  at  least  one  distribution  when  compared 
with  the  distributions  checked.  The  failure  data  on  the 
KT-73  unit  for  cycles  two,  three,  and  four,  however,  did 
not  pass  any  of  the  distributions  tested.  These  three 
failure  cycles  were  further  analyzed.  The  SIMFIT  histogram 
cells  sizes  were  set  equal  and  the  distributions  truncated 
by  'cutting  off  the  cells  which  had  fewer  than  two  entries 
on  the  histogram.  Figure  4.10  resulted  from  this  procedure. 
This  histogram  is  for  failure  cycle  three  but  similar  his- 
tograms were  obtained  from  cycles  two  and  four.  The  peaks 
and  valleys  of  the  histogram  explain  why  these  distribution 
would  not  fit  any  of  the  theoretical  distributions  tested. 
SIMFIT  contains  no  theoretical  distribution  with  these 
cyclic  peaks  and  valleys;  therefore,  further  analysis  was 
not  feasible  in  this  research. 


Infant  ir.ortality.  A second  interesting  finding  v/as 
a large  infant  mortality.  A reviev;  of  the  SI!'.FIT  histo- 
grams in  Appendix  F indicated  that  a large  number  of  early 
failures  (infant  mortality)  had  occurred  for  renewed  units. 
The  G078C  Data  Collection  System  defines  "zero  timers"  for 
these  three  units  as  a failure  that  occurs  within  the  first 
15  hours  of  operation.  This  value  applies  to  either  new 
units  or  renewed  units  (29).  Therefore,  failures  which 
occurred  v/ithin  15  or  less  hours  after  renev;al  were  con- 
sidered as  cases  of  infant  mortality.  These  were  enumer- 
ated and  compared  to  the  total  number  of  failures  for  each 
failure  cycle.  The  results  are  summarized  in  Table  4.9. 
Each  unit  had  a large  infant  mortality  after  renewal 
which  resulted  in  the  large  peak  in  cell  one  of  the  histo- 
grams for  ail  cycles  after  the  initial  cycle.  The  large 
percentage  of  infant  m.ortality  suggests  that  the  exponen- 
tial distribution  should  not  be  used  to  model  the  failures 
of  renewed  IMUs  (3:70-5).  Infant  m.ortality  is  norm.ally 
associated  with  electronic  components  and  therefore  'was  not 
expected  for  these  units. 

Distribution  shift.  After  reviewing  Tables  4.1, 
4.2,  and  4.3,  it  was  clear  that  renewed  units  did  not 
necessarily  follow  the  same  distribution  from  cycle  to 
cycle.  The  LN-15  IMU,  for  example,  passed  the  Erlang, 
Weibull,  and  ganm.a  for  cycles  one  and  tv;o,  but  passed  the 
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Table  4 . 9 


Infant  Mortality 


Unit 

Unit 

Cycle 

Number 

of 

Failures 

Number 

of 

Failures 
In  Less 
15  Hours 

% of 

Failures 
in  Less 
than  15 
Hours 

KT-7  3 

1 

554 

1 

.18 

2 

451 

66 

14.60 

3 

354 

68 

19.20 

4 

258 

53 

20.50 

5 

185 

37 

20.00 

6 

120 

42 

35.00 

FLIP 

1 

145 

1 

.69 

2 

136 

22 

16.2 

3 

127 

1.0 

7.87 

4 

124 

14 

11.29 

5 

118 

14 

11.86 

6 

110 

19 

17.27 

7 

96 

19 

19.70 

8 

82 

7 

8.54 

9 

62 

8 

12.90 

10 

49 

8 

16.33 

LN-15 

1 

316 

0 

0 

2 

213 

17 

7.98 

3 

117 

28 

23.93 

4 

49 

19 

38.76 

Pearson  XI  and  the  lognormal  distribution  for  cycle  three, 
and  subsequently  passed  only  the  Pearson  XI  on  cycle  four 
failures.  This  "distribution  shift"  v/as  also  apparent  in 
the  KT-73  and  FLIP  inertial  measurement  units. 

Two  possible  reasons  for  the  change  in  the  distri- 
butions from  cycle  to  cycle  of  the  same  population  are: 

1.  The  units  with  long  operating  lives  have  not 
yet  failed  and  when  these  units  fail  the  distribution  will 
shift  back  to  the  same  distributions  as  found  in  the 
earlier  cycles. 

2.  Due  to  the  aging  process  different  sub-units  are 
failing  in  the  later  cycles  changing  rhe  underlying  distri- 
butions describing  the  pattern  of  failures. 

If  this  "distribution  shift"  is  present  in  many 
complex  systems  which  are  renewed,  then  knowledge  of  the 
time  and  pattern  of  these  shifts  would  be  a powerful 
planning  tool. 

Sumr.ary 

The  analysis  of  the  SIMFIT  results  indicated  that 
the  exponential  distribution  was  not  the  best  model  for  the 
underlying  distribution  of  the  failure  data  from  the  three 
IMU  populations  tested.  The  gamma  distribution  proved  to 
be  the  "best"  model  for  these  units.  Since  other  distri- 
butions were  better  m.cdels  for  the  IMU  failure  data  than 

73 


the  exponential  distribution,  the  research  hypothesis  was 
supported. 

The  data  aggregation  analysis  indicated  that  valu- 
able information  was  lost  by  the  aggregation  process. 

While  the  individual  failure  cycle  samples  passed  certain 
distributions,  this  information  about  the  failure  models 
was  lost  v/hen  the  data  were  aggregated. 

The  analysis  of  the  convolution  sam.ples  were  some- 
what inconclusive.  The  KT-73  data  did  not  fit  the  Erlang 
distribution  when  convoluted;  therefore,  the  exponential 
distribution  was  not  an  appropriate  model  to  describe  fail- 
ures for  that  population.  Inasmuch  as  the  data  for  the 
other  two  IMUs  (LN-15  and  FLIP)  passed  the  Erlang  distri- 
bution; the  exponential  distribution  could  not  be  ruled  out 
as  being  the  possible  underlying  distribution  of  the  failure 
data  based  only  on  the  results  obtained  from  the  convolution 
analysis . 

After  renewal,  the  reliability  of  the  units  was  less 
than  that  which  was  specified  in  the  contracts  for  the  LN-15 
and  KT-73  IMUs.  The  assumption  of  complete  renewal  with  the 
exponential  m.odel  being  the  underlying  distribution  of  fail- 
ures after  renewal  does  not  appear  correct,  because  the 
"exponential  assumption"  indicates  that  there  is  no  decrease 
in  the  reliability  of  a specific  unit  even  after  100 
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renewals . 


Several  unexpected  results  surfaced  during  the 
analysis  of  the  failure  data.  The  high  infant  mortality, 
after  renewal,  indicated  that  the  exponential  model  was 
inappropriate  for  these  units.  An  additional  fact  was  tha 
the  failure  distributions  were  not  constant  from  cycle  to 
cycle.  No  generalization  was  possible  as  to  the  exact 
reason  for  this  "distribution  shift"  between  cycles.  This 
shift  could  be  peculiar  to  the  units  tested  or  could  be  a 
general  failure  pattern  for  many  types  of  units. 
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CHAPTER  V 

CONCLUSIONS  AND  RECOMMENDATIONS 

This  chapter  discusses  the  conclusions  derived  from 
the  analysis  of  failure  data  for  the  three  Inertial  Measure- 
ment Units.  First,  the  research  hypothesis  is  addressed. 
Then,  additional  conclusions  are  drawn  based  on  the  anal- 
ysis of  the  failure  data.  The  chapter  ends  with  a presen- 
tation of  recommended  areas  for  future  research. 

Support  for  the  Research  Hypothesis 

The  research  hypothesis  was  stated  as:  The  distri- 

bution of  failure  for  some  replaceable/repairable  units 
follov;  a statistical  distribution  which  differs  from  the 
exponential  distribution  model.  All  three  populations  of 
IMU  failure  data  that  were  analyzed  provided  support  to 
this  research  hypothesis.  For  the  FLIP  and  LN-15  units  the 
gamma  proved  to  be  the  best  fit,  and  for  the  KT-73  unit 
four  failure  distributions  proved  to  be  better  fits  than 
the  exponential  model.  These  results  supported  the 
research  hypothesis.  In  addition,  the  convoluted  data  for 
the  KT-73  unit  did  not  fit  the  Erlang  distribution  which 
indicated  that  the  exponential  model  was  inappropriate. 

Once' again  the  research  hypothesis  was  supported. 

The  gamma  distribution  was  able  to  fit  a total  of 
fourteen  of  the  sample  failure  cycles  out  of  the  twenty 
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tested.  This  result  suggested  that  the  gamma  model  may  be 
a better  failure  model  for  all  Inertial  Measurement  Units 
than  the  assumed  exponential  model.  Of  course,  this 
result  should  be  verified  by  additional  analysis  of  other 
IMUs.  If  this  conclusion  is  verified,  then  the  Air  Force 
should  consider  the  use  of  the  gamma  model  when  contracting 
for  additional  irTOs. 

Although  the  gamma  distribution  was  able  to  model 
many  of  the  individual  failure  cycles,  when  data  was  aggre- 
gated, the  gamma  distribution  did  not  fit  the  data.  This 
data  aggregation  provided  expected,  though  disappointing, 
results.  None  of  the  three  populations  passed  any  distri- 
bution tested  after  the  failure  data  were  aggregated.  This 
conclusion  supported  the  argument  against  aggregation  of 
failure  data. 

Currently  most  data  collection  systems  within  the 
Air  Force  aggregate  data.  This  data  is  then  used  to  calcu- 
late the  MTBF  by  the  formula, 

total  ooeratinq  hours 

MTBF  = . 

total  number  of  failures 

This  equation  is  valid  only  when  using  the  exponential 
model,  because  after  data  have  been  aggregated,  the  total 
operating  hours  and  the  total  number  of  failures  are  the 
only  information  which  is  available.  Therefore,  the 
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exponential  model  must  be  assumed  because  other  possible 
models  require  more  information  than  is  provided  by  aggre- 
gated data. 

Renewal 

The  analysis  of  the  failure  data  showed  that  the 
concept  of  complete  renewal  v/as  not  valid.  There  was  a 
significant  drop  in  operating  times  between  failures  in  the 
first  cycle  and  failures  in  the  second  cycle.  This  large 
drop  could  have  been  caused  by  two  primary  factors. 

1.  The  units  were  operated  and  repaired  by  the 
contractor  before  the  Air  Force  gained  repair  responsibility 
for  the  units. 

2.  The  elapsed  time  indicators  were  not  set  to  zero 
when  the  Air  Force  acquired  the  units.  For  example,  a type 
of  "burn  in"  could  have  been  employed  by  the  contractor. 

The  complete  renewal  concept  was  also  shown  to  be 
inappropriate  because  of  the  high  infant  mortality  for 
renewed  units.  For  example,  failure  cycle  four  of  the 
LN-15  unit  had  a 39%  infant  mortality  rate.  This  high 
infant  mortality  could  have  been  caused  by  design  imper- 
fections in  each  of  the  three  units  or  by  improper  or  incom- 
plete maintenance  and/or  checkout  of  the  units.  This 
research  could  not  determine  the  cause  for  the  infant 
mortality,  but  clearly  the  concept  of  complete  renev/al  was 
inappropriate  for  these  IMUs. 


Recommendations 


In  any  research  effort  many  interesting  and  per- 
plexing questions  are  encountered  which  cannot  be  completely 
resolved.  Besides  the  required  amount  of  time  being  pro- 
hibitive, the  research  process  itself  continually  regener- 
ates new  and  just  as  perplexing  questions.  Therefore,  many 
questions  arising  from  a research  effort  must  be  left  for 
future  research.  The  following  list,  in  varying  detail, 
are  areas  relating  to  this  thesis  that  are  recommended  for 
future  research. 

1.  One  area  of  recommended  research,  which  m.ight 
provide  the  most  significant  impact  in  relation  to  this 
thesis,  would  be  an  analysis  between  the  cost  actually 
incurred  by  assuming  the  exponential  failure  model  and  the 
cost  that  would  have  been  incurred  if  the  gamma  model  had 
been  used  in  planning.  This  research  could  be  approached 
by  a cost  analysis  of  one  or  more  of  the  three  IMUs,  and 
then  by  the  use  of  computer  simulation  to  complete  the 
analysis . 

2.  An  investigation  should  be  initiated  to  discover 
why  "distribution  shifts"  occur  between  failure  cycles  for 
an  IMU.  One  approach  would  be  to  obtain  more  failure  data 
for  these  IMUs,  whenever  they  become  available,  and  ana- 
lyze these  data.  If  the  "distribution  shifts"  remain,  more 
detailed  failure  data  from  cards  2-5  of  the  G078C  Data 
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Collection  Systen  could  be  obtained  and  analyzed  as  a means 
of  trying  to  identify  specific  sub-unit  failure  patterns. 
These  patterns  may  be  influential  in  causing  the  observed 
"distribution  shifts"  in  the  IMUs. 

3.  Since  research  builds  on  research  to  provide 
insights  into  reliability,  it  is  recommended  that  another 
set  of  IMU  failure  data  be  collected  and  analyzed  to  (a) 
replicate  this  thesis  and  (b)  determine  if  a generalization 
concerning  the  failure  patterns  for  inertial  measurement 
units  can  be  made. 

4.  A recommendation  is  made  to  investigate  any 
change  in  the  distribution  for  the  FLIP  unit  after  failure 
data  has  been  collected  on  the  unit's  modification  which 
began  in  May  1976.  If  the  underlying  failure  distribution 
does  change  as  a result  of  the  modification,  the  infor- 
m.ation  would  prove  valuable  to  future  modifications  of 
similar  IMUs. 

5.  Future  research  is  recommended  to  determine  if 
other  electro-mechanical  units  experience  the  same  high 
infant  m.ortality  after  renewal  as  that  which  is  experi- 
enced by  the  IMUs  in  this  research. 

6.  A research  effort,  similar  to  the  one  performed 
by  this  thesis,  is  recommended  to  establish  if  support  for 
the  research  hypothesis  can  be  achieved  using  non-aggre- 
gated  data  of  the  G097C  Data  Collection  System  (16)  . 
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7.  Another  recomm.endaticn  for  future  research 
entails  the  use  of  other  distributions  than  those  contained 
within  SIMFIT  to  deternine  if  the  failure  data  for  the 
three  IMUs  can  be  better  modeled  \vith  a distribution  otner 
than  the  gamma. 
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appendix  a 

APPLICABLE  AIR  FORCE 
RELIABILITY  DOCUMENTS 
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Air  Force  Regulation  80-5,  Reliability  and  Maintainability 
Programs  for  Systems,  Subsystems,  and  Equipment  and 
Munitions . 

MIL-HDBK-217B , Reliability  Prediction  of  Electronic 
Equipment. 

MIL-STD-1050 , Sampling  Procedures  and  Tables  for  Inspection 
by  Attributes. 

MIL-STD-414,  Sampling  Procedures  and  Tables  for  Inspection 
by  Variables  for  Percent  Defective. 

MIL-STD-690A,  Failure  Rate  Sampling  Plans  and  Procedures. 

MIL-STD-721B , Definition  of  Effectiveness  Terms  for  Relia- 
bility, Maintainability,  Human  Factors,  and  Safety. 

MIL-STD-756A,  Reliability  Prediction. 

MIL-STD-757,  Reliability  Evaluation  from  Demonstration  Data. 

MIL-STD-781B,  Reliability  Tests;  Exponential  Distribution.  l' 

MIL-STD-785,  Requirements  for  Military  Programs  (for  Systems 

and  Equipment) . j I 

il 

MIL-STD-790A,  Reliability  Assurance  Programs  for  Electronic 
Parts  Specification. 

MIL-STD-1235 , Sampling  Procedures  and  Tables  for  Continuous 
Inspection  by  Attributes. 

TR-7,  Factors  and  Procedures  for  Applying  MIL-STD-105D 
Sampling  Plans  to  Life  and  Reliability  Testing. 
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The  SIMFIT  computer  program  is  used  to  test  a dis- 
tribution to  determine  if  a given  sample  follows  a partic- 
ular theoretical  or  hypothesized  distribution.  The  SIMFIT 
program  calculates  the  parameters  needed  to  determine  the 
probability  distribution  from  the  input  data.  The  input 
data  are  divided  into  cells  and  compared  to  the  theoretical 
value  for  each  cell.  Both  the  nonparametric  K-S  one  sample 
test  and  the  Chi-square  test  are  used  to  test  the  data  at  a 
given  confidence  interval.  The  SIMFIT  program  has  the  capa- 
bility of  testing  the  distribution  at  a = .1,  .5,  or  .01 
(29)  . 

The  K-S  analysis  in  the  SIMFIT  program  is  capable 
of  testing  one  or  all  of  the  twelve  distributions  contained 
within  the  SIMFIT  repertory.  These  distributions  are: 
Erlang,  normal,  lognormal,  gamma,  Pearson  XI,  Weibull, 
uniform,  beta,  triangular,  Poisson,  negative  binomial,  and 
positive  binomial.  The  Erlang  with  parameter  k = 1 is  the 
equivalent  to  the  exponential  distribution  and  the  SIMFIT 
program  was  used  to  test  this  distribution  as  well  as  the 
Erlang,  normal,  lognormal,  gamma,  Pearson  XI,  Weibull,  beta, 
and  negative  binomial  in  this  research  analysis. 

SIMFIT  presents  a Chi-square  analysis  in  addition 
to  the  K-S  analysis.  The  Chi-square  statistic  is  only 

computed  for  that  portion  of  the  distribution  in  which  the 
cell  size  criteria  is  met. 
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The  SIMFIT  program  also  constructs  a histogram  from 
the  input  data.  The  theoretical  distribution  is  compared  to 
the  input  data  on  this  histogram.  The  user  is  required  to 
input  the  scale,  minimum  data  value,  width  of  each  cell,  and 
the  number  of  cells  in  order  to  effectively  use  the  histo- 
gram. To  obtain  the  best  results  these  values  were  approx- 
imated by  the  following  formulas : 

Number  of  Cells  = (number  of  failure  data) (.3)  + .5. 

Range  = maximum  value  of  data — minimum  value  of  data. 
Width  of  Cell  = range/number  of  cells. 

If  these  formulas  are  not  used  and  some  other  values  are  in- 
put for  the  histogram,  then  the  computer  program  will  cal- 
culate these  values  using  the  above  equations  and  suggest 
that  these  calculated  values  be  used  if  the  sample  data  does 
not  fit  the  original  probability  function  (30:17). 

The  SIMFIT  program  gives  the  user  the  option  of 
specifying  the  parameters  of  the  distribution  under  test, 
instead  of  using  the  parameters  calculated  from  the  data. 
This  option  does  not  assure  the  user  of  making  the  correct 
decision,  but  the  capability  of  specifying  parameters  allows 
the  user  to  reject  a distribution  because  of  incorrectly 
given  or  assumed  parameter (s) . 

The  SIMFIT  program  recently  received  an  update  by 
the  rAnd  Corporation.  Further  anticipated  future  updates  of 
additional  distribution  capability  will  make  the  SIMFIT 
program  an  even  more  effective  tool  for  the  analysis  of  raw 
or  simulated  data. 
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APPENDIX  C 

A DISCUSSION  OF  THE  KOLMOGOROV- 
SMIRNOV  AND  CHI-SQUARE  GOODNESS- 
OF-FIT  TESTS 
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NONPARAI'IETRIC  METHODS 


r ^ 


The  two  general  classes  of  significance  tests  are 
the  parametric  and  nonparametric.  Parametric  tests  are 
more  powerful  and  are  generally  the  tests  used  if  the  asso- 
ciated assumptions  are  reasonably  met.  These  assumptions 
include  - ^ . — 49 

1.  The  observations  must  be  independent.  That  is, 
the  selection  of  any  one  case  should  not  affect  the 
chances  for  any  other  case  to  be  included  in  the 
sample . 

2.  The  observations  should  be  draim  from  normally 
distributed  populations. 

3.  These  populations  should  have  equal  variances. 

4.  The  measurement  scales  should  be  at  least 
interval  [9:380]. 

Nonparametric  tests  have  fewer  assumptions.  Prob- 
ably the  most  important  attribute  is  the  lack  of  a need  to 
assvime  that  the  specified  population  is  normally  distributed. 
Also,  in  nonparametric  tests  the  equal  variance  assumption 
is  unnecessary.  The  remaining  two  assumptions  are  of 
varying  importance  in  nonparametric  testing,  depending  upon 
which  test  is  selected  for  the  given  circumstances.  The 
nonpa'rametric  statistical  test  does  not  require  the  model 
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used  to  specify  conditions  about  the  "parameters"  of  the 
population  from  which  it  has  been  drawn. 

Normally,  the  fewer  and  weaker  assumptions  that 
constitute  a given  model,  the  more  general  are  any  con- 
clusions derived  from  the  application  of  the  statistical 
test  associated  with  that  model,  but  the  less  powerful  is 
the  test  of  the  null  hypothesis,  A statistical  test  is 

a good  one,  if  it  has  a small  probability  of  rejecting 
when  is  true  (small  a error) , but  a large  probability  of 
rejecting  when  is  false  (1-S) , which  is  sometimes 
called  the  power  of  a test  (27:18-20).  This  can  more 
readily  be  visualized  by  the  following  illustration. 


State  of  Nature 

Decision 

is  true 

is  false 

Accept 

Correct  Decision 

Tyoe  II  Error 
(S) 

Reject 

Type  1 Error 
(a) 

Correct  Decision 
(1-6)  — 

Power 


The  parametric  tests  have  greater  power  efficiency 
when  their  use  is  appropriate,  although  some  nonparametric 
tests  achieve  a power  efficiency  as  high  as  95  per  cent. 
This,  in  essence,  means  that  the  nonparametric  test  can 
provide  the  same  statistical  testing  power  with  a sample  of 
100  as  a parametric  test  with  a sample  of  95  (9:380). 
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If  the  underlying  distribution  of  given  observations 


of  failure  times  can  be  determined,  more  precise  estimates 
of  reliability  parameters  can  be  obtained.  The  tools  used 
to  do  this  statistically  are  called  goodness-of-fit  tests. 

Several  goodness-of-fit  tests  can  be  applied  to  test  a set 

of  failure  times  for  determining  whether  a hypothesized  | 

distribution  is  a reasonable  model  or  algorithm  (30;F-4). 

Two  jion^:^:netric  goodness-of-fit  tests  used  by  the  SIMFIT 
computer  program  are  the  Kolmogorov-Smirnov  and  the  Chi- 
square  goodness-of-fit  tests. 

KOLMOGOROV-SMIRNOV  TEST 

The  Kolmogorov-Smirnov  (K-S)  test  is  a nonparametric 
goodness-of-fit  test  used  to  test  a sample  to  determine  if 
it  could  have  been  selected  from  a particular  population 
distribution.  The  importance  of  the  K-S  test  to  reliability 
is  in  knowing  the  distribution  of  random  variables  (failures 
or  failure  times)  in  order  to  apply  the  appropriate  mathe- 
matical models  for  analysis  of  "stochastic"  systems 
(24:209) . 

The  development  of  the  K-S  statistic  is  relatively 
straightforward.  Assume  that  a given  population  has  a cumu- 
lative distribution  function  F(x)  and  that  a random  sample 
^1  — ^2  1 ^3  — * ’ * — ^n  size  n has  been  drawn  from  the 
population.  The  empirical  cumulative  distribution  function 
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(x)  is  then  defined  as 


# 


F„(x) 


for  X £ 
for  X,  < X 
for  5^  X 


i ^k+1 


where  k is  an  integer  element  between  0 and  n. 


The  distance -between-  the  -two  -functions  is  used,  as  a measure  . 
of  the  goodness-of-fit . The  maximum  vertical  deviation  is 
expressed  as 


D 


max 


— «o<  X<+‘“ 


F (x) 


F{x)  ! 


Kolmogorov  and  Smirnov  (1;  2;  23)  have  shown  that 

this  vertical  deviation  D is  a distribution-free  statistic; 

i.e.,  D is  a random  variable  whose  distribution  function  is 

independent  of  the  underlying  distribution  function  F(x). 

This  distribution-free  statistic  is  the  reason  that  the  K-S 

test  for  goodnsss-of-f it  is  used  by  the  SIMFIT  program.  The 

cumulative  distribution  function,  K (x) , is  expressed  by 

n 

Kn  (x)  = P (D  _<  X) 


is  computed  as  P(D  £ x)  = (1  - a)  for  n ^ 35  and  as  an 

I 

approximation  for  n > 35  by 


lim  K 

n-><» 


n 


(X) 

VH 


= K(x) 


where  K(x)  is  independent  of  n. 
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From  these  computations  the  fixed  maximum  vertical 
deviation  k between  the  theoretical  and  the  empirical  cumu- 
lative distribution  function  can  '^e  determined  for  a given 
sample  size  n and  a confidence  level  of  (1  - a)  x 100%, 
where 

p[d  < k]  = (1  - a) 

If  |P^(x)  - F(x)|  _<  k for  all  values  of  x,  the  hypothesis 
of  F(x)  being  the  underlying  distribution  function  has  to 
be  accepted  with  (1  - a)  x 100%  confidence,  otherwise  the 
hypothesis  must  be  rejected. 

Characteristics : 

Sample  size:  Ordinal  or  higher  data:  N ^ 1. 

Considerations:  Tests  for  continuous  distributions. 

Parameters  of  F(x)  are  assumed  or  known  and 
form  part  of  the  null  hypothesis. 

Test  Statistic:  D = 1 F^  (x)  - F(x)] 

Testing  Process: 

1.  Hypothesis  Statements 

Null  Hq:x  ~ the  hypothesized  distri- 

bution with  the  desired 
parameter (s) . 

Alternate  Hj^:x  / the  hypothesized  distri- 
bution with  the  desired 
parameter (s) . 

2.  Statistical  Test 
K-S  one  sample  test 
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3.  Significance  Level 


Desired  level  of  a and  N (sample  size) 

4.  Calculated  Value 
Computed  D statistic 

5.  Critical  Test  Value 

Enter  the  critical  values  of  D in  the  K-S 
one  sample  test  for  the  given  a,  N.  If 
grouped  data  is  used,  use  total  N for 
critical  values. 

6.  Decision 

If  the  calculated  value  of  D is  greater 
than  the  critical  value  of  D,  reject  the 
null  hypothesis  at  that  level  of  a. 

If  the  null  hypothesis  is  rejected,  reject 
the  hypothesized  distribution  and  the 
inputed  parameters. 

CHI-SQUARE  (x^)  TEST 

Probably  the  most  widely  used  nonparametric  test  of 
significance  is  the  Chi-square  test.  The  goodness-of-f it 
technique  is  a test  for  significant  differences  between  the 
observed  distribution  of  data  among  categories  and  the 
expected  distribution  based  upon  the  null  hypothesis  (5:371). 

In  the  Chi-square  one  sam.ple  case,  a null  hypothesis 
is  established  from  which  is  deduced  the  expected  frequency 
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of  objects  in  each  category.  Then,  the  deviations  of  the 
actual  frequencies  per  category  are  compared  with  the  hypoth- 
esized frequencies.  The  greater  the  difference  between  them, 
the  less  probable  that  the  differences  can  be  attributed  to 
chance.  The  measure  of  the  extent  of  this  difference  is 
the  value  of  Chi-square.  The  divergence  and  the  Chi-square 

value  are  directly  proportional. 

2 

The  formula  for  the  x test  statistic  is 


= I 


i=l 


(Qj  - ^i^ 

E. 

1 


where  0^  = observed  number  of  cases  categorized  in  the  i 
category. 

= expected  number  of  cases  categorized  in  the  i^^‘ 
category. 

k = the  number  of  categories. 

There  is  a different  distribution  for  x each 

number  of  degrees  of  freedom.  And  depending  upon  the  number 

of  degrees  of  freedom  (df ) , the  size  of  each  cell  must  be 

2 

large  enough  to  make  the  x test  appropriate.  If  df  = 1, 
each  expected  frequency  must  be  at  least  5 in  size.  If 
df  > 1,  80%  of  the  expected  frequencies  should  be  at  least  5. 
Expeqted  frequencies  of  one  or  more  cells  can  be  combined 
to  meet  this  criteria,  but  with  a loss  of  information 
contained  in  the  data  (9:382-3). 
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Characteristics : 


Sample  size:  Large  (n  ^ 35) ; nominal  or  higher  data. 

Considerations:  Tests  for  both  continuous  and  dis- 

crete distributions. 


Parameters  may  be  given  or  estimated  from  the 
sample;  if  estimated,  one  df  is  lost  per  esti- 
mated parameter. 

Test  Statistic: 


i=l 


Testing  Process: 

1.  Hypothesis  Statement 

Null  H :x  ~ the  hypothesized  distri- 

° bution  with  the  specified 

parameter (s) 

Alternate  H :x  / the  hypothesized  distri- 
^ bution  with  the  (desired) 

parameter (s) . 

2.  Statistical  Test 
Goodness-of-fit  test. 

3.  Significance  Level 

Desired  level  of  a and  df  = N-l-(#  of 
estimated  parameters) . 

4.  Calculated  Value 

2 

X -statistic 

5.  Critical  Test  Value 

2 

Obtained  from  x table  with  appropriate  a 
and  df. 
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6. 


Decision 


2 

If  the  calculated  value  of  x is  greater 

2 

than  the  critical  value  of  x > reject  the 
null  hypothesis  at  that  level  of  a. 

If  the  null  hypothesis  cannot  be  rejected, 
sufficient  evidence  to  reject  the  hypoth- 
esized underlying  distribution  is  lacking 
(30:F-14) . 

2 

COMPARISON  OF  THE  K-S  AND  x GOODNESS-OF-FIT  TESTS 


The  Kolmogorov-Smirnov  one-sample  test  treats  indi- 
vidual observations  separately  and  thus,  unlike  the  x^ 
test  for  one  sample,  need  not  lose  information  through 
the  combining  of  categories.  When  samples  are  small, 
and  therefore  adjacent  categories  must  be  combined  before 
X^  may  properly  be  computed,  the  x^  test  is  definitely 
less  powerful  than  the  K-S  test.  Moreover,  for  very 
small  samples  the  x^  test  is  not  applicable  at  all,  but 
the  K-S  test  is.  These  facts  suggest  that  the  K-S  test 
may  be  in  all  cases  more  powerful  than  its  alternative, 
the  X test  [_27:5l]. 


The  assumption  of  continuity  of  the  cumulative 
distribution  is  one  that  is  never  met  in  practice — it  is 
only  approximated.  As  a result,  the  K-S  test  tends  to  be 


conservative.  In  the  practical  research  setting,  the  proba- 
bility of  rejecting  a true  null  hypothesis  is  likely  to  be 


somewhat  smaller  than  the  level  of  significance  specified 
by  the  researcher.  This  problem  may  be  intensified  by 
grouping  the  data  (25:210). 


Ordinarily,  when  all  of  the  assumptions  are  met  and 
the  statistician  has  a choice  between  two  statistical 
tests,  the  more  powerful  of  the  two  will  be  the  one  with 
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the  most  rigorous  assumptions.  Thus,  for  a given  sample 
size,  one  v/ould  anticipate  that  the  Kolmogorov  statistic 
would  be  superior  to  the  Chi-square  when  the  data  are 
continuously  distributed.  However,  the  Kolmogorov  sta- 
tistic suffers  great  loss  of  power  if  this  assumption  is 
violated,  and  this  is  quite  likely  to  be  the  case  in  the 
practical  research  setting.  Some  experimentation  on  the 
part  of  the  investigator  may  be  necessary  to  determine 
which  of  the  two  tests  is  most  appropriate  for  his  data 
[25:212-3]. 

A satisfactory  theory  of  'best  tests'  has  not  yet 
been  developed  for  nonparametric  methods;  therefore  it 
is  necessary  to  rely  heavily  on  intuition,  and  attempt 
to  show  that  the  nonparametric  test  selected  is  superior 
to  other  available  tests  of  this  type  for  the  problem 
being  considered  |_15;327]. 

For  this  reason  the  research  effort  accepted  as 

passing  any  distribution  which  passed  either  the  K-S  test 

2 

or  the  X test  or  both. 
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STATISTICAL  DISTRIBUTIONS 
USED  WITHIN  THE  THESIS 
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EXPONENTIAL  DISTRIBUTION 

The  exponential  distribution  model  is  often  used  in 
reliability.  "For  a good  many  years,  reliability  analysis 
was  almost  wholly  concerned  with  constant  hazard  rates 
L26;185]."  A constant  hazard  rate  implies  an  exponential 
density  function  and  reliability  function.  One  important 
aspect  of  an  exponential  density  function  is  that  it  is 
both  a special  case  of  a gamma  density  function  (26:54)  and 
a special  case  of  a Weibull  density  function  (26:49). 

Let  the  continuous  random  variable  t be  the  time  to 
failure  or  the  time  between  failures.  Then  the  resulting 
distribution 


f(t)  = X exp(-  Xt)  , t ^ 0 
f(t)  =0  t < 0 

is  the  exponential  distribution,  where  X = H(t),  the  hazard 
function. 

The  expected  value  of  the  exponential  is 

“ 1 
E(t)  = / Xt  exp  [-  Xt]dt  = = 0 

o 

and  the  variance  of  the  exponential  is 

V(t)  = E(t)^  - [E(t)]^  = = 0^ 

where  0 is  the  mean  time  to  failure. 


i 

i 

i 
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The  reliability  function  for  the  exponential  is 


R (t)  = exp (-  At) . 

The  cumulative  distribution  function  is 
F(t)  = 1 - expi_-  At ] . 

The  hazard  rate  for  the  exponential  is 

H(t)  = = A exp[-  At]  ^ 

exp|_-  At] 

which  implies  that  the  exponential  distribution  only  applies 
if  the  failure  (hazard)  rate  remains  constant  with  age;  i.e., 
the  failure  probability  in  any  time  period  remains  constant 
throughout  the  unit's  lifetime  (26:185),  In  other  words, 
the  exponential  distribution  is  applicable  as  a reliability 
model  for  failure  times  only  if  the  failure  rate  is  constant 
over  time  (31) . 

In  considering  the  exponential  distribution  model 
for  describing  failures  of  units  in  a renewal  environment, 
the  "complete  lack  of  memory"  property  characterizes  the 
exponential  distribution. 

P{X  >r+s|x>r}=  P{x  > s} 

where' r and  s are  any  positive  numbers  (12:156). 

This  means  that  P{x>s}  is  independent  of  r.  In 
other  words,  if  a piece  of  equipment  has  not  failed 
during  r time  units  its  conditional  probability  of 
serving  r+s  or  more  time  units  is  independent  of  r 
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and  is  equal  to  the  probability  of  serving  s or  more 
time  units.  Stated  differently,  if  time  to  failure  of 
a piece  of  equipment  follows  the  exponential  distri- 
bution, then  aging  of  the  equipment  is  immaterial 
[12:157] . 

0,  the  MTTF,  is  also  a popular  measure  of  relia- 
bility. 

The  MTBF  has  meaning  only  when  one  is  discussing  a 
renewal  situation,  where  there  is  repair  or  replacement. 
. . . Unfortunately  these  two  quantities  i_MTTF  & MTBFJ 
are  sometimes  wrongly  thought  of  as  equivalent,  probably 
because  for  certain  simple  constant-hazard  cases  they 
are  equal.  In  a single-parameter  distribution,  speci- 
fication of  the  MTTF  fixes  the  parameter  i_26  :197]. 

In  a multiple  parameter  distribution,  such  as  the 
Weibull  or  the  gamma,  the  MTTF  places  only  one  constraint 
on  the  model's  parameters. 


WEIBULL  DISTRIBUTION 


In  reliability  applications,  the  Weibull  distri- 
bution is  a general  two-parameter  distribution.  In  recent 
years  many  reliability  applications  have  been  found  for 
this  distribution.  The  density  function  for  the  VJeibull  is 

m f -kt  ) 
f(t)  = kt  exp  — 

, (m+1)  . 


where  m is  the  shaping  parameter  and  k is  the  scaling 
parameter. 

The  Weibull  model  may  represent  not  only  a constant 
hazard  rate  but  also  an  increasing  hazard  rate  and  a type 
of  decreasing  hazard  rate  by  appropriate  selection  of  the 
model  parameters  (26:190). 
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The  cumulative  distribution  function  is 


F(t)  = 1 - exp 


(m+1) 


The  hazard  model  is 


H(t)  = kt‘ 


m 


for  m > - 1. 


The  reliability  function  for  the  Weibull  is 

(m+1)  T 


R(t)  = exp 


-kt 


(m+1) 


In  the  Weibull  model,  if  m = 0,  the  resulting 
distribution  is  the  exponential  distribution.  Therefore, 
the  exponential  distribution  can  be  thought  as  a special 
case  of  the  VJeibull  distribution  (as  previously  stated)  or, 
alternatively,  the  Weibull  distribution  can  be  thought  of 
as  a more  general  case  of  the  exponential  distribution. 

The  expected  value  of  the  Weibull  distribution  is 


, . > lx  ,n+lxm+l 


where  T (gamma) (n)  = / exp(“t)(t)^  ^ dt  = (n-l)I  and  n is 

o 

real  and  positive. 

< The  variance  of  the  Weibull  distribution  is 
2 


V(t)  = 


m+1 

k 


m+ 1 


I '’(1  ^ dr)  - (HI  + 


•)) 
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The  Weibull  distribution  has  two  drawbacks.  First, 
the  model  is  a two  parameter  model,  which  means  that  there 
exists  increased  difficulty  in  estimating  the  parameters. 

But  with  modern  computer  applicability  and  expertise,  this 
drawback  should  be  minor.  The  second  drawback  is  within 
the  model  itself.  The  Weibull  model  cannot  accurately 
represent  the  linearly  decreasing  hazard  rate,  which  is 
useful  in  describing  early  failures.  "However,  with  an 
appropriate  choice  of  k and  m one  should  be  able  to  mini- 
mize this  effect  [26:190l." 

GAMMA  DISTRIBUTION 

Another  distribution  used  in  reliability  is  the 
two-parameter  gamma  distribution.  The  gamma  density 
function  is 

f (t)  = L3  r (a  + 1)3  t expL“  gJ 

where  a > 1,  3 > 0,  0 ^ t £ »,  T (x)  = (x  - 1)!  when  x is  an 
integer.  a is  the  shaping  factor  or  parameter  and  a change 
in  a will  change  the  shape  of  the  curve.  6 is  the  scaling 
parameter  and  will  change  the  vertical  and  horizontal  scales 
(26:54) . 

' The  gamma  tends  to  be  the  key  model  for  a standby 
system. 

If  we  let  t^,  t2»  . . . , tj^  be  the  failure  times 
of  the  on  line  component  and  the  n-1  standby  com.ponents , 
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then  the  failure  time  of  the  system  tg  = tj^  + t2  ... 

The  most  common  model  to  choose  for  each  time-to-f ailure 
distribution  is  the  exponential  ^26: 54], 

If  a = 0,  the  exponential  distribution  is  obtained  from  the 

gamma  distribution  with  1/B  = X.  If  the  ct  paraimeter  is 

defined  to  be  strictly  positive,  the  Erlang  is  obtained  from 

the  gamma  model.  Therefore,  the  gamma  distribution  is  a 

general  case  of  both  the  exponential  distribution  and  the 

Erlang  distribution. 

The  gamma  cumulative  distribution  is 


F(t) 


1 


■ a 

I 


L k=0 


t 

S ' 


where  a is  a positive  integer. 

The  gamma  reliability  function  is 

k T 


R(t)  = 


a 


y — — ( — ) 

l k! 
k=0 


t 

exp  - f 


The  expected  value  of  the  gamma  is 


E(t)  = S (a  + 1)  . 

The  variance  of  the  gamma  distribution  is 
V(t)  = E(t)[s]  = B^(a  + 1)  . 
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ERLANG  DISTRIBUTION 


The  density  function  for  the  Erlang  distribution  is 


f (t)  = t*^  ^ exp[_-  kyt]  for  t ^ 0, 

(k-1)! 

where  u and  k are  strictly  positive  parameters  of  the  distri- 
bution and  k is  restricted  to  being  integer  (14:417).  In 
reliability,  yk  is  set  equal  to  X,  where  X is  strictly  a 
positive  integer  and  the  Erlang  density  function  is 

;yk^k-l 

f(t)  = exoL"  Xt]. 

(k-1)! 


The  Erlang  distribution  is  a special  case  of  the 
gamma  distribution  in  which  the  k param.eter  is  defined  to 
be  strictly  positive  integer.  In  addition,  the  Erlang  dis- 
tribution with  k = 1;  i.e.,  a first-order  Erlang  distri- 
bution, is  equivalent  to  the  exponential  distribution 
(13:159)  . 

The  Erlang  distribution  is  a very  important  distri- 
bution in  reliability.  Suppose  that  ^2’  * * * ' ^n 

n independent  variables  with  an  identical  exponential  dis- 
tribution whose  mean  is  X.  Then  their  sum, 


T - tj^  + t2  + • • • + 

has  an  Erlang  distribution  with  parameters  k and  X/k 
(14:417).  In  reliability  theory,  the  Erlang  distribution 
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is  used  most  extensively  in  studying  standby  systems.  The 
above  development  (called  a convolution)  provides  the  reli- 
ability analyst  with  a means  to  assess  the  applicability  of 
the  exponential  distribution.  If^the  exponential  distri- 
bution is  convoluted  over  seve^^l  renewals,  the  result  is 
an  Erlang  distribution. 

The  expected  value  of  the  Erlang  is 

% 

E(t)  = t/X 

and  the  variance  of  the  Erlang  is 

V(t)  = t/X^. 

NORMAL  DISTRIBUTION 

The  normal,  or  Gaussian,  distribution  is  a well 
known  two-parameter  distribution.  Shooman  showed  that  when 
a certain  parameter  which  is  a random  variable  is  the  sum 
of  many  other  random  variables,  the  parameter  will  have  a 
normal  distribution  in  most  cases  (26:50). 

The  density  function  for  the  normal  distribution  is 

2 

f(t.  ) = ^ ^ exp  - ^ , -00  < t < + “, 

a ■'/Ttt  2a^ 

where  the  parameters  are  u , the  mean  or  mathematical  expec- 
tation, and  a,  the  standard  deviation  (5:195).  y may  be 
any  constant  and  a must  be  positive.  The  density  function 
is  a bell-shaped  curve  that  is  symmetric  around  u (14:325). 
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The  cumulative  distribution  function  for  the  normal 


distribution  is 


F(tb)  = /_ 


VTiT 


exp 


(t  - u) 

2o2 


2 -1 


dt . 


The  expected  value  of  the  normal  distribution  is 


E(t)  = y. 


The  variance  of  the  normal  distribution  is 


V(t)  = 


The  reliability  of  the  normal  distribution  is  given 


by 


R(t.  ) = 1 - F(t.  ) = 1 - / 


-oo  O 


exp 


■ (t  - y)^ 1 


dt . 


LOGNORMAL  DISTRIBUTION 


A random  variable  t follows  a lognormal  distribution 
if  its  logarithm  (In  t)  is  normally  distributed  with  param- 
eters y and  c. 


The  density  function  for  the  lognormal  distribution 
is 


I 


The  variance  of  the  lognormal  distribution  is 


V(t)  = exp[_2(vi  + a )]  - exp(_2u  + 


(12:154-5) 


The  cumulative  distribution  function  is  given  by 


, , , , 1 ,L(ln  t-y)/a] 

F(t)  = P{T  < t}  = exp 


2 1 


dt. 


which  unfortunately  does  not  have  an  easily  integrated 
closed  form. 

The  reliability  function  of  this  distribution  is 
R(t)  = 1 - F(t)  . 

BETA  DISTRIBUTION 

The  beta  distribution  is  a continuous  distribution 
of  a random  variable  with  the  density  function  given  by 

,,t,  0 < t < 1, 

r(o)r(e) 

with  parameters  a and  B.  The  density  function  is  also  often 
found  in  the  form 


f(t)  = 


r (a+B+2) 


r (a+1) r (6+1) 


a B 

t (1-t)  , 0 < t < 1. 
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The  beta  distribution  in  reliability  finds  usage  in 
the  maintainability  area.  Special  cases  of  the  beta  include 
the  uniform  distribution,  the  student's  t-distribution , and 
the  F-distribution  (14:324-5). 

The  cumulative  distribution  of  the  beta  is 


F(t)  = / 


t r(a+3+2) 

° r(a+l)r(3+l) 


t“(l-t)^  dt,  0 < t < 1. 


The  expected  value  of  the  beta  distribution  is 


E(t)  = 


a+8+2  . 


The  variance  of  the  beta  distribution  is 


v(t)  = 


r(a+8+2) 2 (a+S  + 3)]  . 


The  reliability  function  of  the  beta  distribution 


R(t)  = 1 - 


r (ct+3+2) 

r (a+1) r (6+1) 


t ^ 

/ t'^  (1-t)  " dt,  0 < t < 1. 


NEGATIVE  BINOMIAL  DISTRIBUTION 


The  negative  binomial  distribution  is  also  called 
the  Pascal  distribution.  The  density  function  is  given  by 


' I w-k  k 

f (w)  = j (1-p)  p 


w = k , k + 1 , . 


where  w is  the  number  of  trials  considered  and  k is  the 
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number  of  successes  from  w that  is  required  in  a given 
situation.  The  probability  of  success  is  p. 

The  expected  value  of  the  negative  binomial  is 

E(w)  = k/p. 


The  variance  of  the  negative  binomial  is 
V(w)  = k(l-p)/p^. 


The  k^^  success  occurs  on  the  trial  only  if 

there  are  exactly  k-1  successes  in  the  preceding  w-1 
trials  and  a success  occurs  on  the  trial.  The 

probability  of  exactly  k-1  successes  in  w-1  trials  is 
given  by  the  binomial  distribution  model  . . . 
^20:112] . * 


*The  development  of  the  Pearson  XI  distribution  is 
beyond  the  scope  of  this  research  due  to  its  mathematical 
complexity. 
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